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interest, particularly binding affinity, where Che products may be detached from the particle or retained on the particle. The reac- 
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COKPLEX COKBINATORXAL CHEKICAL LIBR2URIES ENCODED WITH TAGS 

This application is a continuation-in-part of U»S* Serial 
No. 08/013,948, filed February 4, 1993, which is a 
continuation-in-part of U.S. Serial No. 07/955,371, filed 
5 October 1, 1992, the contents of both of which are hereby 
incorporated by reference into the subject application. 

Introduction 
Technical Field 

10 The field of this invention concerns combinatorial 
chemistry which involves syntheses having a plurality of 
stages, with each stage involving a plurality of choices, 
where large numbers of products having varying 
compositions are obtained. 

15 

Background of the Invention 

There is substantial interest in devising facile methods 
for the synthesis of large nximbers of diverse compounds 
which can then be screened for various possible 

20 physiological or other activities- Typically such a 
synthesis involves successive stages, each of which 
involves a chemical modification of the then existing 
molecule. For example, the chemical modification may 
involve the addition of a unit, e,g* a monomer or synthon, 

25 to a growing sequence or modification of a functional 
group* By employing syntheses where the chemical 
modification involves the addition of units, such as amino 
acids, nucleotides, sugars, lipids, or heterocyclic 
compounds where the units may be naturally-occurring, 

30 synthetic, or combinations thereof. one may create a 
large number of compounds. Thus, even if one restricted 
the synthesis to naturally-occurring units or building 
blocks, the number of choices would be very large, 4 in 
the case of nucleotides, 20 in the case of the common 

35 amino acids, and essentially an unlimited number in the 
case of sugars. 
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One disadvantage heretofore inherent in the production of 
large number of diverse compounds, where at each stage of 
the synthesis there are a significant number of choices, 
is the fact that each individual compound will be present 
5 in a minute amount. While a characteristic of a 
particular compound, e.g. a physiological activity, may be 
determinable, it is usually impossible to identify the 
chemical structure of this particular compound present. 

10 Moreover, physiologically-active compounds have 
historically been discovered by assaying crude broths 
using Edisonian or stochastic techniques, where only a 
relatively few compounds are assayed at a time, or where 
a limited number of structural similar homologs of 

15 naturally-occurring physiologically-active compounds are 
assayed. Two of the major problems has been associated 
with the use of such crude broths, namely, the necessity 
to purify the reaction mixture into individual component 
compounds and the time-consuming effort required to 

20 establish the structure of the compound once purified. 

To address these disadvantages and problems, techniques 
have been developed in which one adds individual units as 
part of a chemical synthesis sequentially, either in a 

25 controlled or a random manner, to produce all or a 
substantial proportion of the possible compounds which can 
result from the different choices possible at each 
sequential stage in the synthesis. However, for these 
techniques to be successful it is necessary for the 

30 compounds made by them to be amenable to methods which 
will allow one to determine the composition of a 
particular compound so made which shows a characteristic 
of interest. 

35 One such approach involves using a chip which allows for 
separate analysis at physically separate sites on the 
surface of the chip (Fodor et al., Science 251: 767 



wo 94/08051 



-3- 



PCT/US93/09345 



[1991]). By knowing what reactant is added sequentially 
at each such site^ one can record the sequence of events 
and thus the series of reactions • If one then subjects 
the chip to a screening method for a particular desired 
5 characteristic and detects the characteristic one can 
really determine the compound synthesized at the site 
which demonstrates that characteristic. 

Another such technique involves the theoretical synthesis 
10 of oligonucleotides in parallel with the synthesis of 
oligopeptides as the compounds of interest (Brenner and 
Lerner, PNAS USA [1992] £1: 5381-5383). 

Further techniques are also disclosed in the following 
15 publications: Amoto, Science (1992) 257 . 330-331 
discusses the use of cosynthesized DNA labels to identify 
polypeptides. Lam, et al.. Nature (1991) 354 , 82-84 
describe a method for making large peptide libraries. 
Houghton, et al*. Nature (1991) 354, 84-86 and Jung and 
20 Beck-Sickinger, Angew. Chem* Int. Ed. Engl. (1992) 91, 
367-383 describe methodology for making large peptide 
libraries. Kerr et al . , J. Amer. Chem. Soc. , (1993) 115, 
2529-31 teach a method of synthesizing oligomer libraries 
encoded by peptide chains. 

25 

However, since methods such as the preceding typically 
require the additum of like moieties, there is substantial 
interest in discovering methods for producing compounds 
which are not limited to sequential addition of like 

30 moieties. Such methods would find application, for 
example, in the modification of steroids, antibiotics, 
sugars, coenzymes, enzyme inhibitors, ligands and the 
like, which frequently involve a multi-stage synthesis in 
which one would wish to vary the reagents and/or 

35 conditions to provide a variety of compounds. in such 
methods the reagents may be organic or inorganic reagents, 
where functionalities may be introduced or modified, side 
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groups attached or removed, rings opened or closed, 
stereochemistry changed, and the like* (See, for example, 
Bunin and Ellman, JACS 114, 10997 [19923 0 For such a 
method to be viable, however, there needs to be a 
5 convenient way to identify the structures of the large 
number of compounds which result from a wide variety of 
different modifications. Thus, there is a need to find 
a way whereby the reaction history may be recorded, and 
desirably, the structures of the results compound 
10 identifed* 

Finally as the size of a library compounds so synthesizd 
increases, known techniques of structure elucidation and 
product segregation introduce substantial inefficiencies 

15 and uncertainties which hinder the accurate determination 
of the structure of any compound identified as being of 
interest* Thus, there is a substantial need for new 
methods which will peinnit the synthesis of complex 
combinatorial chemcial libraries which readily permit 

20 accurate structural determination of individual compounds 
within the library which are identified as being of 
interest. 

Finally, international applications W091/17823 and 
25 WO92/09300 concern combinatorial libraries. 

Many of the disadvatnages of the previously-described 
methods as well as many of the needs not met by them are 
addressed by the present invention which, as described 
30 more fully hereinafter, provides marlad advantages over 
these previously-described methods. 
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Summarv of The Invention 

Methods and compositions are provided for encoded 
combinatorial chemistry, whereby at each stage of the 
synthesis, a support such as a particle upon which a 
5 compound is being synthesized is uniquely tagged to define 
a particular event, usually chemical, associated with the 
synthesis of the compound on the support. The tagging is 
accomplished using identifier molecules which record the 
sequential events to which the supporting particle is 
10 exposed during synthesis, thus providing a reaction 
history for the compound produced on the support. 

Each identifier molecule is characterized by being stable 
under the synthetic conditions employed, by remaining 

15 associated with the supports during the stage of the 
synthesis, by uniquely defining a particular event during 
the synthesis which reflects a particular reaction choice 
at a given stage of the synthesis, by being 
distinguishable from other components that may be present 

20 during assaying, and by allowing for detachment of a tag 
component which is discernible by a convenient, analytical 
technique • 



The identifiers of this invention are used in combination 
25 with one another to form a binary or higher order encoding 
system permitting a relatively small number of identifiers 
to be used to encode a relatively large number of reaction 
products. For example, when used in a binary code N 
identifiers can uniquely encode up to 2** different 
30 compounds. 

Moreover, the identifiers of this invention need not be 
bound serially through a previous identifier but rather 
are individually bound to the substrate, either directly 
35 or through the product being synthesized. The identifiers 
are not sequencable. Furthermore, the identifiers contain 
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a cleavable member or moiety which permits detachment of 
a tag component which can be readily analyzed. 

Conveniently, the combinatorial synthesis employs 
5 definable solid supports upon which reactions are 
performed and to which the identifiers are bound. The 
individual solid supports or substrates or substrates 
carrying the final product compounds may be screened for 
a characteristic of interest and the reaction history 
10 determined by analyzing the associated identifier tags. 
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DETATLED DESCRIPTION OF THE INVENTION 



As used in this application the term "tag" or "T" means a 
chemical moiety which possesses two properties. First, it 
5 is capedDle of being distinguished from all other chemical 
moieties. Second, it is capable of being detected when 
present at 10*^® to 10*' mole. These two properties may be 
embodied in a single chemical structure. Alternatively, 
these properties may be embodied in separate chemical 

10 structures which are linked together. In this latter 
case, one of the chemical structures, which may be 
designated C (or in the case of more than one such 
structure C, C, etc.) provides the property of rendering 
the tag distinguishable from other tags while the other 

15 chemical structure, E, provides the property of rendering 
the tag detectable and optionally may provide the property 
of rendering the tag separable from other tags. 

As used in this application, the term "linker" or "L" 

20 means a chemical moiety which possesses three properties. 
First, it is attachable to a solid support. Second, it is 
attachable to a tag. Third, when it is attached to both 
a solid support and a tag, it is cleavable such that the 
tag may be released from the solid support. These three 

25 properties may be embodied in a single chemical structure. 
Alternatively, these properties are embodied in three 
chemical structures which are linked together. In this 
latter case one of the chemical structures, which may be 
designated f\ provides the property of rendering the 

30 linker attachable to the solid support; the second 
chemical structure, which may be designated V, provides 
the property of rendering the linker cleavable; and the 
third chemical structure which may be designed A', 
provides the property of rendering the linker attachable 

35 to the tag. Desirably, the chemical structures V and A' 
are one and the same, in which case V-A' may be designated 
f2. 
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As used in this application, the term "identifier" means 
a chemical entity which includes both a tag and a linker. 
ThuSr in the broadest sense an identifier may be 
represented by the formula L-T while specific embodiments 
5 of the identifier may be represented by the formulae F'-V- 
A'-T; F'-V-A'-C-E (or F'-V-A'-E-C) ; L-C-E (or L-E-C) ; and 
L-C-E-Cl. 

As used in this application, the term "bound identifier" 
10 means an identifier attached to a solid support. 

As used herein, the term "choice" means the alternative 
variables for a given stage in a combinatorial synthesis, 
such as reactant, reagent, reaction conditions, and 
15 combinations thereof. Where the term "stage" corresponds 
to a step in the sequential synthesis of a compound or 
ligand; the compound or ligand being the final product of 
a combinatorial synthesis. 

20 The term "alkyl" includes linear, branched, and cyclic 
structures and combinations thereof. Thus, the term 
includes methyl, ethyl, propyl, isopropyl, butyl, sec- and 
tert-butyl , cyclopropyl , cyclobutyl , cyclopenty 1 , 2 - 
methylcyclopropy 1 , and the like. Lower alkyl is C^-C^ 

25 alkyl. Lower alkenyl is C^-C^ alkenyl of a linear, 
branched, or cyclic configuration and combinations 
thereof. 

Unless otherwise indicated, it is intended that the 
30 definitions of any siibstituent ^ e.g. . R^, Z, etc.) in a 
particular molecule be independent of its definitions 
elsewhere in the molecule. Thus, NR*R* represents NHH, 
NHCH3, NHCH2CH3, N(CH3)2, etc. 

35 Some of the compounds described herein contain one or more 
centers of asymmetry and may thus give rise to 
enantiomers, diastereoisomers, and other stero isomeric 
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forms- The present invention is meant to include all such 
possible stereoisomers as well as their racemic and 
optically pure forms. Optically active (R) and (S) 
isomers may be prepared using chiral synthons, chiral 
5 reagents, or resolved using conventional techniques • When 
the compounds described herein contain olefinic doixble 
bonds, it is intended to include both E and Z geometric 
isomers* 

10 The materials upon which the combinatorial syntheses of 
this invention are performed are referred to herein 
interchangeably as beads, solid surfaces, (solid) 
substrates, particles, supports, etc. These terms are 
intended to include: 
15 a) solid supports such as beads, pellets, disks, 

capillaries, hollow fibers, needles, solid 
fibers, cellulose beads, pore-glass beads, 
silica gels, polystyrene beads optionally cross- 
linked with divinylbenzene, grafted co-poly 
20 beads, poly-acrylamide beads, latex beads, 

dimethyl aery 1 amide beads optionally cross-liziked 
with N,N'-bis-acryloyl ethylene diamine, glass 
particles coated with a hydrophobic polymer, 
etc., i.e., a material having a rigid or semi- 
25 rigid surface; and 

b) soluble supports such as low molecular weight 
non-cross-linked polystyrene. 

These materials must contain functionalities or must be 
30 able to be functional ized such that identifiers or product 
intermediates may be attached to them. 

In addition, the following abbreviations have the 

indicated meanings: 
35 AcOH - acetic acid 

BS A « bis ( tr imethy Is i ly 1 ) ace t amide 

CAN = cerium (iv) ammonium nitrate 
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DEAD 




diethylazodicarboxylate 


DCM 




dichlorome thane 


Die 




diisopropy 1 carbod iiiaide 


DMF 




N , N-dimethyl f ormamide 


Fmoc 




9-fluorenylmethoxycarbonyl 


HOBt 




l-hydroxybenzotriazole 


PhMe 




toluene 


r.t. 




room temperature 


TFA 




trifluoroacetic acid 


THF 




tetrahydrofuran 



The subject invention concerns the production of libraries 
of products, i.e. compounds, where the individual products 
or compounds present in the libraries may be physically 
15 separated from one another and may be screened for a 
characteristic of interest either bound to, or detached 
from, a solid support. By having serial syntheses, where 
at each stage of a synthesis each of the individual 
intermediates is treated in a variety of ways, a very 
20 large number of products is produced, each of which is 
present in a small amount, frequently less than 100 pmol^ 
more frequently less than 10 nmol. Because of the small 
quantity of final product or compound so produced, 
identifying these products by isolating and structurally 
25 elucidating the products would generally not be feasible. 
Moreover, in sequential synthesis involving other than the 
addition of similar units, the analysis would be arduous 
if not impossible using the amount of product typically 
available. However, by associating each stage or 
30 combination of stages (e.g., "add reagent A" or "add 
reagent A, then reagent B, and heat to 100*C for 2 hrs.") 
of the serial synthesis with an identifier which defines 
the choice of variables such as reactant, reagent, 
reaction conditions, or a combination of these, one can 
35 use the identifiers to define the reaction history of each 
definable and separable substrate. The analysis of tags 
detached from the identifiers allows for ready 
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identification of the reaction history, at picomolar or 
lower concentrations, e.g. femtomolar or less* One can 
determine a characteristic of a product of a synthesis, 
usually a chemical or biological characteristic by varius 
5 screening techniques, and then identify the reaction 
history and thereby the structure of that product, which 
has the desired characteristic, by virtue of the tags 
associated with the product. 

10 The use of the instant multiple tag system avoids the 
necessity of carrying out a complicated cosynthesis which 
reduces yields and requires multiple protecting groups, 
and avoids the necessity of using sequencable tags which 
are necessarily chemically labile. Both the necessity of 

15 multiple protecting groups and the intrinsic instability 
of all known sequenccible tagging molecules (i.e., nucleic 
acid or peptide oligomers) severely limit the chemistry 
which may be used in the synthesis of the library element 
or ligand. 

20 

Moreover, the use of a binary, or higher, multiple tag 
system reduces enormously the niimber of tags necessary to 
encode the reagent/reactant choice in any stage in a 
synthesis. For example, if a particular synthetic stage 

25 could be carried with 125 different choices for reagent, 
the binary system would require only 7 tags. This can 
make the difference between a practical encoding system 
and an impractical one, because it may not be feasible to 
obtain and use the large number of distinguishable tags 

30 required by other systems. With the binary system of the 
invention, 30 distinguishable tags are available and are 
sufficient to encode >10' different syntheses. 

Importantly, the present method employs tags which are 
35 detachable from a ligand or compound synthesized also for 
the purpose of decoding. Such detachability also allows 
the tags to be distinguished on more than one basis; in 
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particular, they can be separated ( e.g. . on the basis of 
chromatographic retention time) and then analyzed r e,a, , 
a second basis is a spectral property such as mass 
spectroscopy m/e, or electrophoricity) • Having multiple 
5 bases for distinction allows the encoding of large amounts 
of information with a small number of tags. 

Detachment further allows tags to be detected at very low 
levels, because they can be removed from the support 
10 matrix on which the synthesis is effected and from the 
ligand synthesized, the presence of either of which could 
provide spurious background signals, e.g. by quenching 
fluorescence or the like. 

15 Detachable tags are also amenable to rapid analysis by 
automated sampling systems, and allow for selective 
derivatization for detection via functional groups, 
eliminating any incompatibility between the detection 
moiety and the reaction conditions used in the synthesis. 

'20 

Inherent in any tagging scheme is the requirement that the 
chemical characteristics of the tags and the chemical 
stages for their incoiroration be compatible with the 
characteristics of the ligand and the stages in their 

25 synthesis, and vice versa. The advantage of tags that are 
generally unreactive, as exemplified hereinafter by the 
substituted- aryloxypolymethylene moieties, is a greater 
range of chemical transformations and chemical 
functionality that can be employed in synthesis of the 

30 ligands. 

A further advantage of the chemically stable tags of this 
invention is their compatibility with a greater variety of 
rapid, convenient methods of separation and analysis, such 
35 as gas chromatography and mass spectrometry. Moreover, 
the organic tags of these inventions generally do not give 
specifically interact with biological receptors. Thus, 
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then tags will generally not give spurious results in 
biological assays and will generally not be modified by 
enzymes or other biological molecules. 

5 Finally, the chemical stsODXlity of the present tags allows 
them to be detached by a wide variety of methods which 
improves sensitivity in their analysis as described above. 

Thus, this invention provides methods and compositions for 
10 encoded combinatorial synthesis whereby at each stage of 
the synthesis one or more identifiers are provided which 
encode an event associated with a particle stage in the 
synthesis of a compound on a support or particle. This 
event comprises the choice of reactant and/or reaction 
15 conditions at that stage of the reactions where each such 
stage may involve one or more reactants which are the same 
or different under the same or different conditions, e.g. 
partial reactions, multiple additions, rate of addition, 
differing combinations of reagents, etc. In addition, 
20 groups of particles may be sequestered from other groups 
of particles and subjected to a different series of events 
at any time during the course of the sequential synthesis • 

By providing N identifiers, each having M distinguishable 
25 states, different syntheses can be uniquely defined. In 
the case of M=2 where the two states could be the presence 
or absence of identifier, the synthesis would thus be 
defined by a base 2 or binary code. In the case of M=3 
where the three states could be the presence of an 
30 identifier at two distinguishable concentrations or its 
absence, the synthesis would be defined by a base 3 code. 
Herein, such base M codes where M>2 are termed higher 
order codes. The advantage of higher order codes over a 
binary code is that fewer identifiers are required to 
35 encode the same quantity of information about the 
synthesis. The products which are produced will be 
defined as resulting from a serial synthesis. At each 
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stage in the synthesis, there is available a plurality of 
reactants and/or reagents and/ or conditions, which result 
in a feature of the product in relation to an identifiable 
and usually separable entity, e.g. tag. In referring to 
5 reactants and reagents, it is intended that the reactant, 
for the fflost part, becomes incorporated into the product, 
e.g. an amino acid, nucleotide, nucleophile, electrophile, 
diene, alkylating or acylating agent, diamine, or any 
other synthon, etc. while a reagent may or may not become 

10 incorporated into the product, e.g. base, acid, heat, 
oxidizing or reducing agent, while both will be included 
under the term "agent". The synthesis may involve 
individual reactants which become incorporated into the 
product. Alternatively, a stage may involve one or more 

15 reactions which result in a modification of a reaction 
intermediate. In many cases, combinations of these 
possibilities will be involved. 

Using a base 2 or binary code (M=2) and three identifiers 
(N=3) , as many as 8 (2^) agents for a given stage in a 
20 synthesis may be encoded. If the three identifiers are 
represented as Tl, T2, and T3 and the presence or absence 
of each identifier is represented as a '0' or '1' 
respectively, then eight different agents could be 
represented in a binary code as follows: 





Agent 1 


Agent 2 


Agent 3 


1 

Agent 4 




0,0,0 


1,0,0 


0,1,0 


1,1,0 




Agent 5 


Agent 6 


Agent 7 


Agent 8 


T1,T2,T3 


0,0,1 


1,0,1 


0,1,1 


1,1,1 



30 Similarly, even more information about the synthesis may 
be encoded by more identifiers. For example, 9 
identifiers (N=3) and a base 2 code (M=2) would allow up 
to 2^ or 512 different agent choices to be encoded. Using 
a base 3 code (M=3) and three identifiers (N=3) would 

35 allow as many as 27 (3^) agent choices to be encoded. If 
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the three identifiers are represented as Tl, T2 and T3, 
and the absence of an identifier is represented as a '0', 
its presence at a quantity of -0-5 pmol/bead as a '1', and 
its presence of a quantity of -1.0 pmol/bead as a '2', 
5 then the 27 different agents could be represented by three 
identifiers in base 3 code as: 





Agent 1 


Agent 2 


Agent 3 


Agent 4 | 


T1,T2,T3 


0,0,0 


1,0,0 


2,0,0 


0,1,0 




Agent 5 


Agent 6 


• • • 


Agent 27 


T1,T2,T3 


1,1,0 


2,1,0 


• • • 


2,2,2 1 



To make such higher order encoding schemes practical, one 
additional identifier at a given quantity (e.g., -l.o 

15 pmol/bead) would be added to all members of the library to 
provide a standard against which the quantities of all 
identifiers would be measured. The quantities of the 
identifiers could be measured by gas chromatography or 
HPLC with a variety of detection methods. In the case of 

20 HPLC, quantities could be conveniently measured by 
scintillation counting if the identifiers were 
radioactively labeled by different quantities of a 
radionuclide such as tritium (hi) . It would be particularly 
convenient to carry out the quantitation by measuring the 

25 ^H-to-^^C ratio, thus using ^^C as a standard. In this way, 
as many as ten quantities of could be distinguished to 
create a base 10 or decimal code (M=10) which could encode 
enormous amounts of information with very few identifiers. 



30 
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Products and Synthetic Strategies 

For the most part, the products of the method of this 
invention will be organic compounds and removal of 
chemical units, reactions involving the modification or 
5 introduction of one or more functionalities, ring 
openings, ring closings, etc. Chemical units can take 
many forms, both naturally-occurring and synthetic, such 
as nucleophiles, electrophiles, dienes, alkylating or 
acylating agents, diamines, nucleotides, amino acids, 
10 sugars, lipids, or derivatives thereof, organic monomers, 
synthons, and combinations thereof. Alternatively, 
reactions may be involved which result in alkylation, 
acylation, nitration, halogenation, oxidation, reduction, 
hydrolysis, substitution, elimination, addition, and the 
15 like« This process can produce non-oligomers, oligomers, 
or combinations thereof in extremely small amounts, where 
the reaction history, and composition in appropriate 
cases, can be defined by the present tags. Non-oligomers 
include a wide variety of organic molecules, e.g. 
20 heterocyclics, aromatics, alicyclics, aliphatics and 
combinations thereof, comprising steroids, antibiotics, 
enzyme inhibitors, ligands, hormones, drugs, alkaloids, 
opioids, terpenes, porphyrins, toxins, catalysts, as well 
as combinations thereof • Oligomers include oligopeptides, 
25 oligonucleotides, oligosaccharides, polylipids, 
polyesters, polyamides, polyurethanes , polyureas, 
polyethers, poly (phosphorus derivatives) e.g. phosphates, 
phosphonates , phosphoramides, phosphonamides , phosphites, 
phosphinamides, etc., poly (sulfur derivatives) e.g. 
30 sulfones, sulfonates, sulfites, sulfonamides, 
sulf enamides, etc. , where for the phosphorous and sulfur 
derivatives the indicated heteroatom for the most part 
will be bonded to C, H, N, O or S, and combinations 
thereof. 

35 

Reactions may involve modifications at a variety of random 
sites of a central core molecular structure or 
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modifications at a specific site. For example, one may 
brominate a polycyclic compound, where bromination may 
occur at a plurality of sites or use a brominating agent 
which will be specific for a particular site, e.g., N- 
5 bromosuccinimide. For the most part, reactions will 
involve single sites or equivalent sites, for example, one 
of two hydroxyl groups of a glycol. 

For the most part, the subject synthesis will have at 
10 least two stages where other than bifunctional compounds 
are attached using the same linking functionality, e.g. 
amino acids and amide bonds, nucleotides and phosphate 
ester bonds, or mimetic compounds thereof, e.g., amilioiso- 
cyanates and urea bonds. 

15 

The methods of the invention permit the variation in 
reaction at each stage, depending on the choice of agents 
and conditions involved. Thus, for amino acids, one may 
have up to 20 amino acids involved using the common 

20 naturally-encoded amino acids and a much wider choice, if 
one wishes to use other amino acids, such as D-amino 
acids, amino acids having the amino group at other than 
the a-position, amino acids having different substituents 
on the side chain or substituents on the amino group, and 

25 the like. For the different nucleic acids, there will 
usually be up to 4 natural nucleic acids used for either 
DNA or RNA and a much larger number is available if one 
does not choose to use those particular nucleic acids. 
For the sugars and lipids, there are a very large number 

30 of different compounds, which compounds may be further 
increased by various substitutions, where all of these 
compounds may be used in the synthesis. For individual 
organic compounds the choice may be astronomically large. 
In addition, one may have mimetic analogs, where ureas, 

35 urethanes, carbonylmethylene groups, and the like may 
substitute for the peptide linkage; various organic and 
inorganic groups may substitute for the phosphate linkage; 
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and nitrogen or sulfur may substitute for oxygen in an 
ether linkage or vice versa. 

The synthetic strategies will vary with the nature of the 
5 group of products one wishes to produce. Thus, the 
strategy must take into consideration the ability to 
stage-wise change the nature of the product, while 
allowing for retention of the results of the previous 
stages and anticipating needs for the future stages. 
10 Where the various units are of the same family, such as 
nucleotides, emino acids and sugars, the synthetic 
strategies are relatively well-established and frequently 
conventional chemistry will be available. Thus, for 
nucleotides, phosphoramidite or phosphite chemistries may 
15 be employed; for oligopeptides, Fmoc or Boc chemistries 
may be employed where conventional protective groups are 
used; for sugars, the strategies may be less conventional, 
but a large number of protective groups, reactive 
functionalities, and conditions have been established for 
20 the synthesis of polysaccharides. For other types of 
chemistries, one will look to the nature of the individual 
unit and either synthetic opportunities will be known or 
will be devised, as appropriate. 

25 In some instances, one may wish to have the same or 
different blocks introduced at the same or different 
stages. For example, one may wish to have a common 
peptide functional unit, e.g. the fibronectin binding unit 
(RGDS) , a polysaccharide, e.g. Le'^, an organic group, e.g. 

30 a lactam, lactone, benzene ring, olefin, glycol, 
thioether, etc. introduced during the synthesis. In this 
manner one may achieve a molecular context into which the 
variation is introduced. These situations may involve 
only a few stages having the plurality of choices, where 

35 a large number of products are produced in relation to a 
particular functional entity. This could have particular 
application where one is interested in a large number of 
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derivatives related to a core molecule or unit known to 
have a characteristic of interest. 

In developing synthetic strategies, one can provide for 
5 batch synthesis of a few compounds which would be prepared 
during the course of the coiRbinatorial synthesis. By 
taking extreme examples, for example, syntheses which 
might involve steric hindrance, charge and/or dipole 
interactions, alternative reaction pathways, or the like, 

10 one can optimize conditions to provide for enhanced yields 
of compounds which might not otherwise be formed or be 
formed only in low yield. In this manner, one may allow 
for a variety of reaction conditions during the 
combinatorial synthesis, involving differences in solvent, 

15 temperatures, times, concentrations, and the like. 
Furthermore, one may use the batch syntheses, which will 
provide much higher concentrations of particular products 
than the combinatorial synthesis, to develop assays to 
characterize the activity of the compounds. 

20 

Supports: Attachment and Detachment 

The synthetic protocol requires that one provide for a 
plurality of different reactions involving different 
reactants resulting in a plurality of different 

25 intermediates at each stage of the synthesis. While other 
techniques are available, this can be achieved most 
conveniently by employing small definable solid 
substrates, commercially available as beads, which can be 
readily mixed, separated, and serve as a solid substrate 

30 for the sequential synthesis. The solid substrates may be 
solid, porous, deformable or hard, and have any convenient 
structure and shape. In some instances, magnetic or 
fluorescent beads may be useful. The beads will generally 
be at least 10-2000 fim, usually at least 20-500 ^m, more 

35 usually at least 50-250 ^,m in diameter. 



wo 94/08051 



PCr/US93/09345 



-20- 

Any convenient composition can be used for the particles 
or beads, which bead composition will maintain its 
mechanical integrity during the various process stages, 
can be functional ized, has functional groups or allows for 
5 reaction with an active species, allows for the serial 
synthesis as well as attachment of the identifiers, can be 
readily mixed and separated, and will allow for convenient 
detachment of the tags and products. Beads which may be 
employed include cellulose beads, pore-glass beads, silica 
10 gel, polystyrene beads, particularly polystyrene beads 
cross-linked with divinylbenzene , grafted co-polymer beads 
such as polyethyleneglycol/polystyrene, polyacrylamide 
beads, latex beads, dimethylacrylamide beads, particularly 
cross-linked with N,N'-bis-acryloyl ethylene diamine and 
15 comprising N-t-butoxycarbonyl-/9-alanyl-N'-acryloyl 
hexamethylene diamine, composites, such as glass particles 
coated with a hydrophobic polymer such as cross-linked 
polystyrene or a fluorinated ethylene polymer to which is 
grafted linear polystyrene; and the like. General reviews 
20 of useful solid supports (particles) that include a 
covalently-linked reactive functionality may be found in 
Atherton, et al., Prospectives in Peptide Chemistry ^ 
Karger, 101-117 (1981); Amamath, et al., Chem. Rev. 
77:183-217 (1977); and Fridkin, The Peptides . Vol. 2, 
25 Chapter 3, Academic Press, Inc., (1979), pp. 333-363. 

Depending upon the nature of the synthetic procedure or 
the assay of the final product, one or another bead may be 
more or less desirable. While beads are especially 
30 convenient, other solid supports may also find use, such 
as capillaries, hollow fibers, needles, solid fibers, 
etc. , where the size of the solid support allows for the 
desired variation in reaction histories. 

35 Depending upon the nature of the synthesis, the beads may 
be functionalized in a variety of ways to allow for 
attachment of the initial reactant. These may be linked 
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through a non-labile linkage such as an ester bond, amide 
bond, amine bond, ether bond, or through a sulfur, 
silicon, or carbon atom, depending upon whether one wishes 
to be able to remove the product from the bead, 
5 Conveniently, the bond to the bead may be permanent, but 
a linker between the bead and the product may be provided 
which is cleavable such as exemplified in Table 1. Two or 
more different linkages may be employed to allow for 
differential release of tags and/or products. 

10 

Depending upon the nature of the linking group bound to 
the particle, reactive functionalities on the bead may not 
be necessary where the manner of linking allows for 
insertion into single or double bonds, such as is 
15 available with carbenes and nitrenes or other highly- 
reactive species. In this case, the cleavable lin3cage 
will be provided in the linking group which joins the 
product or the tag to the bead* 

Desirably, when the product is permanently attached, the 
link to the bead will be extended, so that the bead will 
not sterically interfere with the binding of the product 
during screening. Various links may be employed, 
particular hydrophilic links, such as polyethyleneoxy, 
saccharide, polyol, esters, amides, combinations thereof, 
and the like* 

Functionalities present on the bead may include hydroxy, 
carboxy, iminohalide, amino, thio, active halogen (CI or 
30 Br) or pseudohalogen (e.g., -CFj, -CN, etc.), carbonyl, 
silyl, tosyl, mesylates, brosylates, triflates or the 
like- In selecting the functionality, some consideration 
should be given to the fact that the identifiers will 
usually also become bound to the bead. Consideration will 
35 include whether the same or a different functionality 
should be associated with the product and the identifier, 
as well as whether the two functionalities will be 



20 



25 
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compatible with the product or identifier attachment and 
tag detachment stages, as appropriate • Different linking 
groups may be employed for the product, so that a specific 
quantity of the product may be selectively released. In 
5 some instances the particle may have protected 
functionalities which may be partially or wholly 
deprotected prior to each stage, and in the latter case, 
reprotected. For example, amino may be protected with a 
carbobenzoxy group as in polypeptide synthesis, hydroxy 
10 with a benzyl ether, etc. 

Where detachment of the product is desired, there are 
numerous functionalities and reactants which may be used. 
Conveniently, ethers may be used, where substituted benzyl 
15 ether or derivatives thereof, e.g. benzhydryl ether, 
indanyl ether, etc. may be cleaved by acidic or mild 
reductive conditions. Alternatively, one may employ 
/3-elimination, where a mild base may serve to release the 
product. Acetals, including the thio analogs thereof, may 
20 be employed, where mild acid, particularly in the presence 
of a capturing carbonyl compound, may serve. By combining 
formaldehyde, HCl and an alcohol moiety, an a-chloroether 
is formed. This may then be coupled with an hydroxy 
functionality on the bead to form the acetal. Various 
25 photolabile linkages may be employed, such as 
o-nitrobenzyl, 7-nitroindanyl, 2-nitrobenzhydryl ethers or 
esters, etc. Esters and amides may serve as linkers, 
where half -acid esters or amides are formed, particularly 
with cyclic anhydrides, followed by reaction with hydroxyl 
30 or amino functionalities on the bead, using a coupling 
agent such as a carbodiimide. Peptides may be used as 
linkers, where the sequence is subject to enzymatic 
hydrolysis, particularly where the enzyme recognizes a 
specific sequence. Carbonates and carbamates may be 
35 prepared using carbonic acid derivatives, e.g. phosgene, 
carbonyl diimidazole, etc. and a mild base. The link may 
be cleaved using acid, base or a strong reductant, e.g.. 



wo 94/08051 PCr/US93/09345 

-23- 

LiAlH^, particularly for the carbonate esters. For a list 
of cleavable linkages, see, for example, Greene and Huts, 
Protective Groups in Organic Synthesis, 2nd ed. Wiley, 
1991. The versatility of the various systems that have 
5 been developed allows for broad variation in the 
conditions for attachment of products and identifiers and 
differential detachment of products and tags, as desired. 

The following table indicates various illustrative linking 
10 units f i.e. > in Formula I) and the manner in which they 
may be cleaved: 
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Table 1. Various illustrative linking units and the 
manner in which they may be cleaved. 



5 


1 

Linking Group 


Cleavage Reagent 




silyl 


fluoride or acid 




A 


hv 




B 


Ce(NHJ,(NOO^ 




-NCO^CL) * 


OH , IT, or LiAlH^ 


10 


C 


0^, Osoyio^', or KMnO^ 




D 


1) Op or Br,, MeOH 

2) H,0^ 




-Si-(L) 


oxidation, H*, Brj, Clj, 
etc. 




£ 


H^O* 




F 


H^O"" 


15 


G 


F" or H* 




H 


base , OH' 


20 


X = keto, ester, amide, 
NO,, sulfide, sulfoxide, 
sulfone, and related 
electron withdrawing 
groups 






I 


HjO* or reduction (e.g. 




J 


(0,P)^RhCl(H) 


25 


K 


Li,Mg, or BuLi 




Hg"2 




N 


Zn or Mg 


30 


X =s halogen or 
pseudohal ogen 






0 


oxidation (e.g. Pb(OAc), 
or H,IOJ 




P 


base 


35 


X = electron withdrawing 
group 





*(L) shows the point of attachment of the tag or product. 
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L is the tag or product either directly bonded to the 
indicated atom or indirectly bonded through a linking 
group such as C(0)0, which linking group may provide a 
convenient functionality. 
5 R is H or lower alkyl. 

Linker. 

The choice of linker for the ligand will be part of the 
synthetic strategy, since the linking group may result in 
10 a residual functionality on the product* It will usually 
be difficult, but feasible, to further modify the product 
after detachment from the bead. In designing the 
synthetic strategy, one can use a functionality 'to be 
retained in the product as the point of attachment for the 
15 linking group. Alternatively, when permitted by the 
nature of the product, one could use a cleavage or 
detachment method which removes the linking functionality, 
e.g., an arylthioether or silyl with a metal hydride or 
acid. Since in many cases the synthetic strategy will be 
20 able to include a functionalized site for linking, the 
functionality can be taken advantage of in choosing the 
linking group. In some instances it may be desirable to 
have different functionalities at the site of linking the 
product to the support, which may necessitate using 
25 different modes of linking, which modes must accommodate 
either the same detachment method or different detachment 
methods which may be carried out concurrently or 
consecutively, e.g., irradiation with light and acid 
hydrolysis. 

30 

Of particular interest for binding the identifiers to the 
particle are carbenes and nitrenes which can insert 
between a carbon and hydrogen atom to form a covalent 
bond, or into an olefinic bond to form a cyclopropane (in 
35 the case of carbene) or an aziridine (in the case of 
nitrene) . 
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With carbene or nitrene linking groups various substituted 
benzenes may be used, where the benzene is substituted 
with a group capable of providing a carbene: CHN^, COCHN^, 
SO2CHN2; or nitrene: N3, NO^, NO, ^^^N^. The carbenes may 
5 be generated from diazoalkane derivatives by photolysis, 
thermolysis, or by treatment with low valent transition 
metal species, e^g. , Rh(OAc)2. The nitrene may be 
generated by photolysis or thermolysis from azides; and 
from nitro, nitroso and azides by using tervalent 
10 phosphorus compounds or low valent transition metals. 

A group of linker moieties (F^-F^-) of particular interest 
include 2-nitro-4-carboxybenzyloxy , 2-nitro-4- 
diazoacetylbenzyloxy, 4 or 5 azidomethylcarbonyl-2- 
15 methoxyphenoxy, and 2-raethoxy-4, or 5-carboxyphenoxy 
moieties « 

Illustrative compounds where T represents the tag, Z 
represents a carbene or nitrene precursor or a carboxy 

20 group, and R is H or lower alkyl are as follows. For 
photochemical tag detachment (e.g., with ultraviolet light 
at about 350 nm) : T 3-Z-2-nitroben2yl ether, T 4-Z-2- 
nitrobenzyl ether, T 5-Z-2-nitrobenzyl ether, T 6-Z-2- 
nitrobenzyl ether, T 2-Z-4-nitroben2yl ether, T 3-Z-4- 

25 nitrobenzyl ether, T 3--Z-2-nitroben2yl carbonate, T 4-Z-2- 
nitrobenzyl carbonate, T 5-Z-2 -nitrobenzyl carbonate, T 6- 
Z -2 --nitrobenzyl carbonate, T 2 -2 -4 -nitrobenzyl carbonate, 
and T 3 -Z-4 -nitrobenzyl carbonate. For oxidative 

detachment (e.g., using eerie ammonium nitrate): l-OT-2- 

3 0 OR-3 -Z -benzene , l-OT-2 -OR-4 -Z -benz ene , 1 -OT-2 -OR-5-Z - 
benzene, l-OT-2-OR-6-Z-benzene, 1-OT-4-OR-2-Z -benzene, and 
l-OT-4-OR-3-Z-benzene. For reductive or alkylative 
detachment (e.g. with lithium/ ammonia or methyl iodide): 
T (2-Z-phenyl)thioether, T (3-Z-phenyl) thioether, and T 

35 (4-Z-phenyl) thioether. For desilylative detachment (e.g., 
using tetrabutyl ammonium fluoride or acid) : T dialkyl- 
(2-2-phenyl) silyl ether, T dialkyl- (3-Z-phenyl) silyl 
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ether, T dialkyl-(4-Z-'phenyl) silyl ether, T-dialkyl~(2-Z- 
phenyl)silane, T-<iialkyl-(3-z-phenyl)silane, and T- 
dialkyl- (4-Z-phenyl) silane. 

5 Coint)inatoiriaX Synthesis 

The synthesis will usually involve stages involving at 
least 2 choices, frequently at least 4 choices, and may 
involve 10 choices or more. Generally, the number of 
choices per stage will not exceed about 100, more usually 

10 not exceed about 50. The number of stages will usually be 
at least about 3, more usually at least about 4, 
frequently at least 5, and not more than about 30, more 
usually not more than about 25, preferably not more than 
about 20, more preferably not more than about 10, 

15 frequently not more than about 8, 

The nximber of choices and stages will usually result in at 
least a number of compounds which allows for a sufficient 
variety to provide a reasonable likelihood that at least 

20 one compound will have the characteristic of interest* 
The subject methodology allows for producing greater than 
25,000 compounds, usually greater than 50,000 compounds, 
preferably greater than 200,000 compounds, and a million 
or more may be produced. This will usually mean at least 

25 20 compounds but may be 10^ or more. 

In some syntheses, a stage may only involve one or two 
choices, but this situation will usually be limited in 
relation to the number of compounds one wishes to produce 

30 and the particular synthetic strategy. In many of the 
strategies, the restricted nximber of choices, i.e., fewer 
than 5 choices, more usually 2 or fewer choices, will be 
limited to the greater of 40% of the total number of 
stages or about 2 stages in the sequential synthesis, more 

35 usually limited to 20% of the total number of stages. 

Reaction Procedure. 
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In carrying out the synthesis, one may initially begin 
with a number of beads, usually at least 10^, more usually 
at least 10^, and desirably at least 10^, while generally 
not exceeding at least 10^^, more usually not exceeding at 
5 least 10^°. Depending upon the number of choices in the 
first stage, one will divide up the particles accordingly 
into as many containers. One can use microtiter well 
plates, individual containers, columns, gels, Terasaki 
plates, flasks, Merrifield synthesis vessels, etc. The 
10 particles will usually be divided up into groups of at 
least one particle each, usually a plurality of particles, 
generally 1000 or more, and may be 10^ or more depending on 
the total number of particles and choices involved in the 
stage . 

15 

One would then add the appropriate agents to each of the 
individual containers to process them in stages and add 
the identifiers which encode the reagent and stage. Each 
stage would provide the desired reaction. Once the 

20 reaction (s) is complete, one may wish to wash the beads 
free of any reagent, followed by combining all of the 
beads into a single mixture and then separating the beads 
according to the number of choices for the next stage. 
This procedure of dividing beads, followed by the tagging 

25 and synthesis stages (or vice versa) , and then recombining 
beads is iterated until the combinatorial synthesis is 
completed* 

In some instances, the same reaction may be carried out in 
30 2 or more containers to enhance the proportion of product 
having a particular reaction at a particular stage as 
compared to the other choices. In other instances, one or 
more of the stages may involve a portion of the beads 
being set aside and undergoing no reaction, so as to 
35 enhance the variability associated with the final product. 
In other situations, batches may be taken along different 
synthetic pathways. 
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In order to record or encode the synthesis history on the 
beads, in one einbodiement £ or cl or both may be present 
and subsequent attachment of C excludes the presence of 
C^at each stage one would tag the beads associated with 
5 each choice and stage with their own unique combination of 
identifiers. Alternately one may use a single tag to 
record or enclode ths synthesis history. Depending on the 
chemistries involved, this tagging may be done prior to, 
after, or concomitantly with the reactions which comprise 
10 each choice. Further, as a control, sample beads may be 
picked at any stage and a portion of their tags cleaved 
off and decoded to verify that the correct tags are bound 
to the sample beads. 

15 As indicated previously, in some instances, portions of 
the particles will be segregated into subsets, where each 
of the subsets would then undergo a different reaction 
series. At any time, the portions may be recombined into 
a single mixture for subsequent reaction. For example, if 

2 0 at one stage one introduces unsaturation, one could 
provide two subsets, where in one subset the unsaturation 
is reduced, while in the other sui^set the unsaturation is 
epoxidized. These two subsets could then be subjected to 
different reaction series. 

25 

After synthesis of the products is complete, they are 
screened for a desired property either after detachment of 
the ligand from the bead or while still attached. In the 
latter case, beads, for example, may be incubated in 

30 aqueous buffer with mouse monoclonal antibody Y. After 
incubation and washing, the beads are incubated with 
alkaline phosphatase-conjugated rabbit (or goat) 
polyclonal antibody directed against mouse antibodies. 
Using a fluorescent precipitation developing reagent, 

35 fluorescent beads with attached monoclonal antibody are 
identified and manually separated from the majority of 
clear, unstained beads. Alternatively, the fluorescent 
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beads can be separated using a fluorescence-activated cell 
sorter, so long as the tags are retained on the bead under 
the conditions of sorting. Each selected fluorescent bead 
is subjected to a means for releasing at least some of the 
5 tags from the bead. 

In instances where the synthesis does not involve the 
stagewise addition of like units, or where reaction 
byproducts are formed, there may be instances where there 

10 will be a plurality of compounds on a single bead or the 
structure of the active compound cannot be known from its 
reaction history. In accordance with the siibject 
invention, by knowing the reaction history, one may repeat 
the synthesis on a larger scale so as to obtain a 

15 sufficient amount of the product (s) to isolate the 
product (s) and structurally identify the active compound. 

The subject methodology may be illustrated using various 
reaction sequences. For example, barbiturates may be 

20 prepared by combining an aldehyde or ketone with an 
acetate ester to prepare a crotonate under Claisen 
conditions to provide an unsubstituted to tetrasubstituted 
crotonate. The crotonate may then be combined with a 
second acetate under Michael conditions, whereby a 

25 glutarate may be obtained having up to 6 substituents. 
The glutarate may then be combined with ammonia or 
monosubstituted amine to provide the barbiturate. By 
varying the aldehydes and ketones, the acetates and the 
amines, a great variety of barbiturates may be obtained. 

30 Where functionalities are present on one or more of the 
substituents, such as amino, carboxy, hydroxy, thiol, and 
the like, these groups may be protected or modified as 
desired. 

35 In another example described by Bunin and Ellman, J. Am. 

Chem. Soc. , 114, 10997 (1992), benzodiazepines are 
produced. One begins the synthesis with different amino 
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protected substituted 2-aminoben2ophenones bound to 
individual particles through, for example a 4'-oxy group* 
To each different group of particles in different vessels, 
after deprotection, are added a different Finoc-protected 
5 cr-amino acid, either naturally occurring or synthetic, 
under conditions where a peptide bond is formed. After 
deprotection, internal cyclization is caused, followed by 
alkylation on nitrogen with an alkylating agent. In only 
three stages, a very large number of benzodiazepines may 
10 be prepared and the libraries screened for tranquilizing 
or other activity. 

A wide variety of drug analogs may be produced, such as 

analogs of antihypertensive agents, e.g. enalapril; 
15 /?-blocJcers, e.g. propanolol; antiulcer drugs (Hg-receptor 

antagonists) e.g. cimetidine and ranitidine; antifungal 

agents (cholesterol-demethylase inhibitors) e.g. 

isoconazole; anxiolytics, e.g. diazepam; analgesics, e.g. 

aspirin, phenacetamide , and fentanyl; antibiotics, e.g. 
20 vancomycin, penicillin and cephalosporin; 

antiinflammatories, e.g. cortisone; contraceptives, e.g. 

progestins; abortifacients, e.g. RU-456; antihistamines, 

e.g. chlorphenamine; antitussives, e.g. codeine; 

sedatives, e.g. barbitol; etc. 

25 

An illustrative synthesis of cimetidine analogs could 
involve hydroxymethylsubstituted histidines, and related 
heterocycles, where the remaining carbon atoms or nitrogen 
atoms could be further substituted or unsubstituted, 
30 a, w-aminoalkyl thiols, and substituted thioamidine esters, 
where the groups on nitrogen could be varied, such as 
nitro, cyano, hydroxy, alkyl, combinations thereof, and 
the like. 

35 Identifier 

The identifiers of this invention may be represented by 
the Formula I: 
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pUp2.C-E-C' I 
Where F^-F^' is a linker which allows for attachment to a 
support and detachment of the tag from a support; and 

C-E-C'is the tag which is capable of detection and 
5 distinguishability; 

£ is a tag component which (a) allows for detection, 
such as an electrophoric group which can be analyzed by 
gas chromatography or mass spectroscopy or (b) allows for 
detection and for separation; 
10 C and C' are tag components which allow for 

individual distinguishing one tag from all other tags, 
usually allowing for separation as a result of variable 
length or substitution, for example, varying the 
chromatographic retention time or the mass spectroscopy 
15 ratio m/e; 

is a linking component capable of being selectively 
cleaved to release the tag component; and 

F^ is a functional group which provided for attachment 
to the support; or 
20 F^ is a bond when F^ is a cleavable group such as OH 

or carboxy. 

Although the identifiers of Formula I are typically added 
at each appropriate stage and choice during the 

25 combinatorial synthesis, the portion £ can be added at the 
end of the syntheses either before or after cleavage 
(preferably photochemically or oxidatively) from the 
substrate. Specifically, where C contains OH, NHR^, or SH, 
E can be attached to c prior to cleavage. Alternatively, 

30 if E is attached after cleavage, the point of attachment 
at c may be where F^ was attached. This is exemplified in 
the scheme on the following page: 
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20 

where S = substrate and 



n = 1-40 

Attachment of the identifier to the substrate can be 
represented as follows: 

25 F^-F^-C-E-C' + S > S-F^'-F^-C-E-C' 

where F^'-F^-C-E-C' represents the identifier residue 
attached to the substrate. For example, when the bead is 
functionalized with an aminomethyl group and F^ is CO^H, 
then F^' is -C(0)-? when the bead contains an unsaturated 
30 bond and F^ is NgCH-CCO)-, then F^' is =CH-C(0)- or 
-CHj-CCO)-. 

Of particular interest for use as identifiers are 
compounds of Formula I of the Foinaula la: 

F^-f2-(C(E-C') la 

35 
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Wherein: 

is COjH, CHjX, NR^r\ C(0)R\ OH, CHN2, SK, C(0)CHN2, 
S(02)C1, S(02)CHN2, N3, NO2, NO, S{02)N3, OC(0)X, C(0)X, 
NCO, or NCS; 
5 is ^ 

. ' —Si{R\k— , — Si(fi')— . 

CH2A — 

A — 

1 s 1 

— KCCO)0— . ^ (CR2)2 . c^pij — ^ 




— SKCHjS— (CRj^— A— — R^(CR2?2A— ^ 
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— S — C(R*)2 A— - — C(X)R^^ — C(R )2 • 



— G(OH)R^ C[R\k—. — C(OH)R— C(CH2X)S— , 

— C(0H)R^C(R*)2— C(X)K— . — C(0H)(CH2CH2X) — . 
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A is -O, -0C(0)0-, -OC(0)-, or -NHC(O)-; 
C is a bond, C^-Cgo alkylene optionally substituted by 1-40 
F, CI, Br, q-Cg alkoxy, NR^rS OR*, or NRS or 
-C(C{R*)2)„-Y-Z-Y-(C(R*)2)„Y-Z-Y]p-; with the proviso 
5 that the naximtun number of carbon atoms in C+C' is 

preferably 20; 

Ci is H; F; cl; C^-Cjq alkylene optionally substituted by 
1-40 F, Cl, Br, C^-C^ alkoxy, NR^R^^, OR^^, or NR*, or 
- [ (C (R*) 2) „-Y-Z-Y- ( C (R*) 2) „Y-Z-Y ] ; 
10 E is C^-C^p alkyl substituted by 1-20 F, Cl or Br; or 

Q-aryl wherein the aryl is substituted by 1-7 F, Cl, 
NO2, S02R^, or substituted phenyl wherein the 
substituent is 1-5 F, Cl, N02r or S02R^; 

R^ is H or C^-C^ alkyl; 
15 R^ is C=0, C(0)0, C(0)NR\ S, SO, or SO2; 

R* is H or C^-C^ alkyl; 

r5 is C^-C^ alkyl; 

a is 1-5; 

b is 1-3; 
20 m and n is each 0-20; 

p is 1-7; 

Q is a bond, O, S, NR*, C=0, -C(0)NR^, -NR^C(O)-, -C(0)0-, 
or -OC(0)-; 

X is a leaving group such as Br, Cl, triflate, mesylate, 
25 tosylate, or OC{0)OR^; 

Y is a bond, O, S, or NR*; 

Z is a bond; phenyl optionally substituted by 1-4 F, Cl, 
Br, C,-C^ alkyl, C^-C^ alkoxy, C^-C^ alkyl substituted 
by 1-13 F, Cl, or C^-C^ alkyloxy substituted by 1-13 

30 F, Cl, or Br; (C(R*) 2) ^.jo' ^ ^^2^1-20' ^^^^ proviso 

that when Z is a bond one of its adjacent Y's is also 
a bond; and 

aryl is a mono- or bi-cyclic aromatic ring containing up 
to 10 carbon atoms and up to 2 heteroatoms selected 
35 from O, S, and N. 

In the definitions of F^ in Formula la, the left-hand 
bond as depicted attaches to F^, 



wo 94/08051 



PCr/US93/09345 



-39- 

Also useful as identifiers are compounds of the 
Formula la' ; 

F^-(C(E-C').)j, la' 

wherein: 
5 is OH or COOH; and 

the remaining definitions are as in Formula la. 
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Pref erred compounds of Formula la are those wherein; 
is 

2 . CO2H. OH. CHN2. C(0)CHN2. C(0)X. KCS. or CHoX: 



5 



10 



15 




C and C' is each independently C^-Cjq alkylene unsubstituted 
25 or substituted by 1-40 F or CI, or (O- (^2)3.3]^; 

E is C^-C^Q alkyl substituted by 1-20 F or CI; Q-aryl where 
aryl is a bi-cyclic aromatic ring substituted by 1-7 
F or Cl; or Q-phenyl substituted by 1-5 F, CI, NO2, or 
S02R^; and 

30 Q is a bond, 0, -NR^CCO)-, or -0C(0)-, 

Preferred compounds of Formula la are those wherein 
-C(E-C')3 is represented by -(CH2)3.,5-(CF2)^.i5F, 
- (CH2) 3.,5- (CCI2) ^.^^ci , - (CH2CH2-O) 1.5-Ar , 

35 -(CH2CH2CH20)^.5-Ar, or - {CH2) ^.^2'^""-^^' 

wherein Ar is pentafluoro- pentachloro-, or 
pentabromophenyl , 2,2,5, 6-tetraf luoro-4 (2,3,4,5,6- 
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pentaf luorophenyl ) phenyl , 2,4, 6-trichlorophenyl , 
2, A, 5-trichlorophenyl , 2, 6-dichloro-4-f luorophenyl , 
or 2,3,5, 6-tetraf luorophenyl . 
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wherein Ar is pentafluoro- pentachloro-, or 
pentabromophenyl , 2,3,5, 6-tetraf luoro-4 (2,3,4,5,6- 
pentaf luoropheny 1 ) phenyl , 2,4, 6-trichlorophenyl , 
2,4, 5-trichlorophenyl , 2 , 6-dichloro-4 -f luorophenyl , 
5 or 2 , 3 , 5 , 6-tetraf luorophenyl . 

Other preferred compounds of Formula la are those 
wherein E-C' is H, OH, or NHj. Such compounds are 
particularly useful for reaction with an E at the end of 
the combinatorial synthesis, especially with an E 
10 detectable by fluorescence or electron capture, such as 
dansyl chloride or polyhalobenzoylhalide. 

The compounds of Formula I can be prepared according to 
the following exemplary schemes or other means known to 
15 those skilled in the art. 



20 



SCHEME 1 

Identifier Tag Preparation 



25 




lEq.CSjCOs, DMF 
90°c, 2 hi 



ZLECTBOPHOSIC PEEIOL, AcOS 

X = CI OR r. 
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8CHEME 2 

Identifiers With Photolvtic Cleavage Linkers 



5 




0 
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8CBEME 3 

Identifiers wi<:h Oxidative Cleavage Linkers 




one 



FPiig. Jim. 

TOLUEBE 



HeO 



one 
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8CEEME 4 

Identifiers With Alternative Oxidative Release Linkers 




OMe 



PFI13. DEAD, mm 




OMe 
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8CHEKE 5 
E-C^ Tags 



5 



10 




SCHEME 6 

15 Identifiers With Photolvtic Cleavag e Linkers 



1. COCLj. TOLUENE 




0 r 



wo 94/08051 



-48- 



PCT/US93/09345 



SCHEME 7 

Identifiers With Oxidative Cleavage Linkers 

5 
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The identifier may comprise one or a plurality of 
identical tags. The identifiers will be individual 
chemical compound (s) which may be distinguished one from 
the other and will uniquely identify different choices and 
5 stages. In this manner, very large combinatorial 
libraries may be prepared with a relatively small number 
of identifiers, usually fewer than 50 tags. 
During each stage, a combination of identifiers will be 
added, which defines the stage and choice. Each 

10 identifier will be bound, either . covalently or non- 
covalently to the bead or to the product, usually the 
bead. Combinations of identifiers are used to provide a 
binary or other code at each stage, whereby the choice and 
stage may be defined. The combination of identifiers may 

15 include zero or only one identifier. 

Tags 

So far as the tags (C-E-C') are concerned, the tags which 
20 are employed will be characterized as follows: by being 
removable from the bead by means depending on F^, 
preferably by photolysis or oxidation; by being 
individually dif ferentiable, usually separable; by being 
stable under the synthetic conditions; by encoding both 
25 stage and choice so as to uniquely define the choice of 
agent used at each stage in the synthesis; desirably, 
there should be an easy way to identify the various tags 
with readily-available equipment which does not require 
sophisticated technical capabilities to operate; they 
30 should be relatively economical and provide a strong 
signal based on a relatively few molecules; and the tags 
should provide sufficient sensitivity to permit 
distinguishing the tags from the other components which 
may be present during the tag determinations. 

35 

The tags may be structurally related or unrelated, as in 
a homologous series, repetitive functional groups, related 
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members of the Periodic Chart, different isotopes, 
combinations thereof, or the like. The tags may be used 
as elements of a binary code, so that one tag can define 
two choices, two tags can define four choices, three tags 
5 can define eight choices, five tags can define thirty-two 
choices, etc. Thus, at each stage of the synthesis, a 
relatively small number of tags can designate a much 
larger number of choices. The tags comprising the 
identifiers for each stage may or may not be related to 
10 other stages. Each tag for any combinatorial synthesis 
must allow for being distinguished from all other tags. 
In this manner, very large combinatorial libraries may be 
prepared with a relatively small number of tags, usually 
fewer than 60 tags, more usually fewer than about 50 tags. 

15 

For each bead, there will usually be at least 0.01 
femtomol, more usually 0.001-50 pmol, of each tag, 
although lesser or greater amounts may be used in special 
circumstances. The amount of product may also be at least 

20 in the same range and up to at least 10^ or more greater, 
usually being at least 0.01 pmol, more usually at least 
1.0 pmol and generally not more than about 10 nmol. 
Depending upon the number of beads, the number of stages 
and the number of choices per stage, the nximber of 

25 products produced will usually exceed 10^, more usually 
10^, and may exceed 10'°, usually not exceeding about 10®, 
preferably being in the range of about 10^ to 10®, more 
usually 10^ to 10®. 

30 The tags will, for the most part, be organic molecules. 
Each tag will usually have fewer than about 100 atoms, 
more usually fewer than about 80 atoms, generally fewer 
than about 60 atoms, other than hydrogen, excluding a 
linking moiety which would not be retained on release of 

35 the tag from the bead. The linking moiety may be of any 
size, usually being fewer than about 30 atoms, more 
usually fewer than 20 atoms, other than hydrogen. The 
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size of the linking moiety is not critical, but one of 
convenience. The tags ©ay form families of compounds, 
where all of the compounds are of a similar nature or may 
be combinations of different families, where the compounds 
5 may be aliphatic, alicyclic, aromatic, heterocyclic, or 
combinations thereof. Distinguishing features may be the 
number of repetitive units, such as methylene groups in an 
alkyl moiety, allcyleneoxy groups in a polyalkyleneoxy 
moiety, halo groups in a polyhalocompound, a- and/or 
10 substituted ethylenes, where the substituents may involve 
alkyl groups, oxy, carboxy, amino, halo, or the like; 
isotopes; etc* 

Tag Analysis 

15 

Tags may be removed from the bead using reductive, 
oxidative, thermolytic, hydrolytic, or photolytic 
conditions depending on the nature of the group F^; for 
example, by oxidation of a catechol ether with eerie 
20 ammonium nitrate or by photolysis of a nitrobenzyl ether 
or ester or amide, or by other methods, e.g. as shown in 
Table 1. 

Differentiation of tags can be achieved with physical 
25 differences, e.g. molecular weight of the tags or the 
chromatographic retention time using gas or liquid 
chromatography. Positional isomers may have different 
retention time. If positional isomers or steroisomers are 
inadequate for physical separation, then one could use 
30 varying numbers of substituents, e.g. halogens, such as 
fluorines, methyl groups, oxy groups, or other side chains 
in conjunction with differing numbers of units, e.g. 
methylene groups or ethyl eneoxy groups, to provide the 
desired separation. Ratios of radioisotopes could be 
35 used, where the radioisotopes provide for differential 
emission, for example ^"^C and ^H. The physical differences. 
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particularly mass number, can provide information about 
choice and stage. 

Instead of ^*C/^H ratios, one could use combinations of non- 
5 radioactive isotopes, e.g. -CHJ)^, where m is 0 and up to 
3 and n is 3 minus m. For example, by detecting the 
varying amounts of up to four different methyl groups 
using mass spectroscopy, one could define a large number 
of choices. 

10 

When E is a bond and C' is H, the tags obtained upon 
release from the support have an active functionality for 
reaction with a labeling reagent which introduces a 
detectable tag component E. Conveniently, the 

15 functionality could be a double bond, particularly an 
activated double bond^ hydroxy, thio, amino, carboxy, etc. 
The tag would then be reacted with an excess of the 
labeling reagent to provide the product (E-C) for 
analysis. In this way a wide variety of labeling reagents 

20 could be used as part of the identifying system, which may 
not be compatible with the synthetic strategy for the 
product of interest. Labeling reagents which may be used 
for detection include haloaromatics (e.g., perf luorobenzyl 
bromide), fluorescers (e.g., dansyl chloride), 

25 radioisotopes, chemiluminescers , etc. 

While exemplary tags and reactions have been given, it 
should be understood that many other combinations could be 
employed. 

30 

Depending on the chemical and physical nature of the tags, 
an appropriate method for separation is chosen, desirably 
one of various chromatographic procedures including gas ^ 
chromatography (GO) , liquid chromatography (LC) 
35 particularly high-performance liquid chromatography 
(HPLC) , thin layer chromatography (TLC) , electrophoresis, 
etc. Instead of chromatographic procedure, mass 
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spectrometry may be employed for separation by mass 
number > Tags include: 

for GC: chemically inert organic molecules having 
different molecular weights including alkanes, alkenes, 
5 arehes, halocarbons, ethers, alcohols, silanes, 
thioethers, etc., particularly halogenated compounds, with 
or without other functionalities, for electron capture 
detection or mass spectroscopy detection (MS) with 
capillary GC separation, and for compound with elements 

10 not normally found in organic chemistry (e.g., Sn, Ge) for 
atom emission detection with GC capillary seperation; 
for LC, HPLC or TLC: see above for GC, conveniently 
linear ethers or hydrocarbons with substitution by 
radioisotopes or combinations of radioisotopes for 

15 radioassay detection or suitable groups for fluorescence 
detection after separation; 

for electrophoresis: see above, particularly 

functionalized charged molecules, e.g. cat ionic or 
anionic, particularly organic or inorganic acid groups, 
20 where the molecule may be further modified by having a 
detectable radioisotope or fluorescer for detection in the 
electrophoresis; 

for mass spectroscopy: see above, particularly different 
mass numbers due to different isotopes, different numbers 
25 of the same functionality or different functionalities, 
different members of a homologous series or combinations 
thereof. 

The separation of tags from one another may involve 
30 individual techniques or combinations of techniques, e.g. 
chromatography and electrophoresis; gas chromatography and 
mass spectroscopy; etc. 

The tags of the present invention will have a property 
35 which allows detection at very low levels, usually not 
greater than nanomol, preferably picomol or less, more 
preferably femtomol or less, in the presence of other 
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compounds which may be present at significantly higher 
levels. For this reason r specific atomic substitutions 
may be used to render the labels easily detectable. Such 
substitutions include: 
5 (a) substitution by electronegative elements, e.g. 
fluorine or chlorine, for electron capture detection in 
conjunction with capillary GC or negative ion mass 
spectroscopy detection ; 

(b) substitution by an uncommon element (excluding C, 

10 ' and O) for atomic emission detection in conjunction with 
capillary GC; 

(c) substitution by several uncommon elements for atomic 
emission detection to determine the ratio between the 
elements; 

15 (d) substitution by a radioactive element, e.g. ^H, for 
detection by autoradiography or scintillation counting in 
conjunction with LC, TLC or electrophoresis; 
(e) substitution by a multiplicity of radioactive elements 
having differing emissions, e.g. "^H and ^^c, for detection 

20 by autoradiography or scintillation counting to determine 
the ratio of the different radioactive elements. 

For single-element substitution (a., b. , d. above) a 
separable mixture of A tags whose simple presence or 

25 absence can be detected would encode up to 2^ different 
syntheses. For multiple-element substitution (see, c. and 
e. above) a separable mixture of A tags each having B 
distinguishable states (e.g., different ^H/^^c ratios, 
different Si/Sn ratios) would be able to encode for up to 

30 B* different syntheses. 

A wide variety of isotopes exist, where the presence or 
ratio of isotopes may provide information as to stage and 
choice. The isotopes may be radioactive or non- 
35 radioactive. Isotopes of particular interest include 
deuterium, tritium, ^^c, ^^P, ^^^I, etc. 
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By employing mixtures of isotopically-modif ied compounds, 
one can greatly expand the information obtained from a 
single tag compound which is only distinguished by the 
presence of isotopes. For example, one could prepare a 
5 mixture of ratios of hydrogen to deuterium, where the 
various ratios could differ by as little as 10% each. 
By replacing hydrogens with another atom, such as 
fluorine, one would then have a varying mixture of 
hydrogens, deuteriums and fluorines, providing for a large 
10 number of different dif ferentiable tags. 

Other groups that may be involved could be aromatic rings, 
which are differentially substituted, as to position and 
functionality. Thus, by having substituted benzene rings, 

15 where the position of the substitution and the nature of 
the substitution can be determined, one can provide for a 
plurality of molecules which can be distinguished and can 
provide for both stage and choice information. For 
example, if C were constant one could detect and 

20 discriminate through the substitution pattern on E when E 
is a polyhalogenated aromatic ring* 

There is also the possibility to use fluorescent tags. 
While fluorescent tags alone may not be sufficient to 
25 define a significant number of stages with a significant 
niimber of choices, as referred to above, by providing for 
means for separating the fluorescent tagging molecules 
based on variations in c or C' , one can individually 
detect the tags by their fluorescence. 

30 

The mixture of tags associated with a particular bead may 
be detached and subject to an initial separation, where it 
is desirable to detect each of the tags separately. Once 
the group of tags has been separated, each of the tags may 
35 then be analyzed based on its particular functionalities 
and distinctive properties. Various techniques which may 
be used to detect the particular tags include 
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autoradiography or scintillation counting, electron 
capture detection, negative or positive ion mass 
spectroscopy, infrared spectroscopy, ultraviolet 
spectroscopy, electron spin resonance spectroscopy, 
5 fluorescence, and the like. 

Assays 

To determine the characteristic of interest of the 
product, a vide variety of assays and techniques may be 
10 employed « 

Frequently, in screening the beads, one will use either 
single beads or mixtures of beads and determine whether 
the bead or mixtures show activity. Thus, the mixtures 
15 may involve 10, 100, 1000 or more beads. In this way, 
large groups of compounds may be rapidly screened and 
segregated into smaller groups of compounds. 

One technique is where one is interested in binding to a 

20 particular biomolecule such as a receptor. The receptor 
may be a single molecule, a molecule associated with a 
microsome or cell, or the like. Where agonist activity is 
of interest, one may wish to use an intact organism or 
cell, where the response to the binding of the subject 

25 product may be measured* In some instances, it may be 
desirable to detach the product from the bead, 
particularly where physiological activity by transduction 
of a signal is of interest. Various devices are available 
for detecting cellular response, such as a 

30 microphysiometer, available from Molecular Devices, 
Redwood City, CA. Where binding is of interest, one may 
use a labeled receptor, where the label is a fluorescer, 
enzyme, radioisotope, or the like, where one can detect 
the binding of the receptor to the bead. Alternatively, 

35 one may provide for an antibody to the receptor, where the 
antibody is labeled, which may allow for amplification of 
the signal and avoid changing the receptor of interest. 



wo 94/08051 



PCT/US93/0934S 



which might affect its binding tot he product of interest. 
Binding may also be determined by displacement of a ligand 
bound to the receptor, where the ligand is labeled with a 
detectable label. 

5 

In some instances, one may be able to carry out a 
two-stage screen, whereby one first uses binding as an 
initial screen, followed by biological activity with a 
viable cell in a second screen. By employing recombinant 
10 techniques, one can greatly vary the genetic capability of 
cells. One can then produce exogenous genes or exogenous 
transcriptional regulatory sequences, so that binding to 
a surface membrane protein will result in an observcible 
signal, e.g. an intracellular signal. For example, one 
15 may introduce a leuco dye into the cell, where an enzyme 
which transforms the leuco dye to a colored product, 
particularly a fluorescent product, becomes expressed upon 
appropriate binding to a surface membrane, e.g. 
/3-galactosidase and digalactosidyl fluorescein. In this 
20 manner, by associating a particular cell or cells with a 
particular particle, the fluorescent nature of the cell 
may be determined using a FACS, so that particles carrying 
active compounds may be identified. Various techniques 
may be employed to ensure that the particle remains bound 
25 to the cell, even where the product is released from the 
particle. For example, one may use antibodies on the 
particle to a surface membrane protein, one may link 
avidin to the surface of the cell and have biotin present 
on the particle, etc. 

30 

Assays may be performed stagewise using individual 
particles or groups of particles or combinations thereof. 
For example, after carrying out the combinatorial 
syntheses, groups of about 50 to 10,000 particles may be 
35 segregated in separate vessels. In each vessel, as to 
each particle a portion of the product bound to the 
particle is released. The fractional release may be as a 
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result of differential linking of the product to the 
particle or using a limited amount of a reagent, condition 
or the like, so that the average number of product 
molecules released per particle is less than the total 
5 nianber of product molecules per particle. One would then 
have a mixture of products in a small volume. The mixture 
could then be used in an assay for binding, where the 
binding event could be inhibition of a known binding 
ligand binding to a receptor, activation or inhibition of 
10 a metabolic process of a cell, or the like. Various assay 
conditions may be used for the detection of binding 
activity as will be described subsequently. Once a group 
is shown to be active, the individual particles may then 
be screened, by the same or a different assay* One could 
15 of course, have a three- or four-stage procedure, where 
large groups are divided up into smaller groups, etc. and 
finally single particles are screened. in each case, 
portions of the products on the particles would be 
released and the resulting mixture used in an appropriate 
20 assay. The assays could be the same or different, the 
more sophisticated and time consuming assays being used in 
the later or last stage. 

One may also provide for spatial arrays, where the 
25 particles may be distributed over a honeycomb plate, with 
each well in the honeycomb having 0 or 1 particle. 

The subject methodology may be used to find chemicals with 
catalytic properties, such as hydrolytic activity, e.g. 

30 esterase activity. For this purpose one might embed beads 
in a semisolid matrix surrounded by diffusible test 
substrates. If the catalytic activity can be detected 
locally by processes that do not disturb the matrix, for 
example, by changes in the absorption of light or by 

35 detection of fluorescence due to a cleaved substrate, the 
beads in the zone of catalytic activity can be isolated 
and their labels decoded. 
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Instead of catalytic activity, compounds with inhibitory 
or activating activity can be developed • Cpmpounds may be 
sought that inhibit or activate an enzyme or blocX a 
binding reaction. To detect beads that inhibit an enzyme, 
5 which beads have an attached product with this desirable 
property, it is advantageous to be able to release the 
products from the beads, enabling them to diffuse into a 
semisolid matrix or onto a filter where this inhibition, 
activation or blocking can be observed. The beads that 

10 form a visualized or othervise detectable zone of 
inhibition, activation or blocking can then be picked and 
the tags decoded. In this case it is necessary that a 
portion of the synthesized products be attached to the 
beads by cleavable linkages, preferably a photolabile 

15 linkage, while a portion of the tags remain attached to 
the bead, releasable after picking by a different means 
than before. 

A dialysis membrane may be employed where a layer of beads 
20 is separated from a layer of radiolabeled ligand/ receptor 
pair. The bead layer could be irradiated with ultraviolet 
light and the product released from the bead would diffuse 
to the pair layer, where the radiolabeled ligand would be 
released in proportion to the affinity of the compound for 
25 the receptor. The radiolabeled ligand would diffuse back 
to the layer of beads. Since the radiolabel would be 
proximal to the bead, beads associated with radioemission 
would be analyzed. 

30 Of particular interest is finding products that have 
biological activity. In some applications it is desirable 
to find a product that has an effect on living cells, such 
as inhibition of microbial growth, inhibition of viral 
growth, inhibition of gene expression or activation of 

35 gene expression. Screening of the compounds on the beads 
can be readily achieved, for example, by embedding the 
beads in a semisolid medium and the library of product 
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molecules released from the beads (while the beads are 
retained) enabling the compounds to diffuse into the 
surrounding medium. The effects, such as plaques with a 
bacterial lawn, can be observed. Zones of growth 
5 inhibition or growth activation or effects on gene 
expression can then be visualized and the beads at the 
center of the zone picked and analyzed. 

One assay scheme will involve gels where the molecule or 
10 system, e*g. cell, to be acted upon may be embedded 
substantially homogeneously in the gel. Various gelling 
agents may be used such as polyacrylamide, agarose, 
gelatin, etc. The particles may then be spread over the 
gel so as to have sufficient separation between the 
15 particles to allow for individual detection. If the 
desired product is to have hydrolytic activity, a 
substrate is present in the gel which would provide a 
fluorescent product. One would then screen the gel for 
fluorescence and mechanically select the particles 
20 associated with the fluorescent signal. 

One could have cells embedded in the gel, in effect 
creating a cellular lawn. The particles would be spread 
out as indicated above. Of course, one could place a grid 

25 over the gel defining areas of one or no particle. If 
cytotoxicity were the criterion, one could release the 
product, incubate for a sufficient time, followed by 
spreading a vital dye over the gel. Those cells which 
absorbed the dye or did not absorb the dye could then be 

30 distinguished. 

As indicated above, cells can be genetically engineered so 
as to indicate when a signal has been transduced. There 
are many receptors for which the genes are known whose 
35 expression is activated. By inserting an exogenous gene 
into a site where the gene is under the transcriptional 
control of the promoter responsive to such receptor, an 
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enzyme can be produced which provides a detectable signal, 
e.g. a fluorescent signal. The particle associated with 
the fluorescent cell(s) may then be analyzed for its 
reaction history. 

5 

Libraries and Kits 

For convenience, libraries and/or kits may be provided. 
The libraries would comprise the particles to which a 
library of products and tags have been added so as to 

10 allow for screening of the products bound to the bead or 
the libraries would comprise the products removed from 
the bead and grouped singly or in a set of 10 to 100 to 
1000 members for screening. The kits would provide 
various reagents for use as tags in carrying out the 

15 library syntheses. The kits will usually have at least 4, 
usually 5, different compounds in separate containers, 
more usually at least 10, and may comprise at least 10^ 
different separated organic compounds, usually not more 
than about 10^, more usually not more than about 36 

20 different compounds. For binary determinations, the mode 
of detection will usually be common to the compounds 
associated with the analysis, so that there may be a 
common chromophore, a common atom for detection, etc. 
Where each of the identifiers is pre-prepared , each will 

25 be characterized by having a distinguishable composition 
encoding choice and stage which can be determined by a 
physical measurement and including groups or all of the 
compounds sharing at least one common functionality. 

30 Alternatively, the kit can provide reactants which can be 
combined to provide the various identifiers. In this 
situation, the kit will comprise a plurality of separated 
first functional, frequently bifunctional, organic 
compounds, usually four or more, generally one for each 

35 stage of the synthesis, where the functional organic 
compounds share the same functionality and are 
distinguishable as to at least one determinable 
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characteristic- In addition, one would have at least one, 
usually at least two, second organic compounds capable of 
reacting with a functionality of the functional organic 
compounds and capable of forming mixtures which are 
5 distinguishable as to the amount of each of said second 
organic compounds. For example, one could have a glycol, 
amino acid, or a glycolic acid, where the various 
bifunctional compounds are distinguished by the number of 
fluorine or chlorine atoms present, to define stage, and 

10 have an iodomethane, where one iodomethane has no 
radioisotope, another has ^^C and another has one or more 
^H. By using two or more of the iodomethanes , one could 
provide a variety of mixtures which could be determined by 
their radioemissions. Alternatively, one could have a 

15 plurality of second organic compounds, which could be used 
in a binary code. 

As indicated previously one could react the tags after 
release with a molecule which allows for detection. In 

20 this way the tags could be quite simple, having the same 
functionality for linking to the particle as to the 
detectable moiety. For example, by being linked to a 
hydroxycarboxyl group, an hydroxy 1 group would be 
released, which could then be esterified or etherified 

25 with the molecule which allows for detection. For 
example, by using combinations of fluoro- and chloroalkyl 
groups, in the binary mode, the number of fluoro and/or 
chloro groups could determine choice, while the number of 
carbon atoms would indicate stage. 

30 

Groups of compounds of particular interest include linkers 
joined to a substituted ortho-nitrobenzyloxy group, 
indanyloxy or fluorenyloxy group, or other group which 
allows for photolytic or other selective cleavage. The 
35 linking group may be an alkylene group of from 2 to 20 
carbon atoms, polyalkyleneoxy, particularly alkyleneoxy of 
from 2 to 3 carbon atoms, cycloalkyl group of from 4 to 8 
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carbon atoms, haloalkyl group, particularly fluoroalkyl of 
from 2 to 20 carbon atoms, one or more aromatic rings and 
the like, where the linker provides for the discrimination 
between the various groups, by having different numbers of 
5 units and/or stibstituents. 

Individual particles or a plurality of particles could be 
provided as articles of commerce, particularly where the 
particle (s) have shown a characteristic of interest • 

10 Based on the associated tags, the reaction history may be 
decoded. The product may then be produced in a large 
synthesis* Where the reaction history unequivocally 
defines the structure, the SEUue or analogous reaction 
series may be used to produce the product in a large 

15 batch. Where the reaction histoiry does not unambiguously 
define the structure, one would repeat the reaction 
history in a large batch and use the resulting product for 
structural analysis- In some instances it may be found 
that the reaction series of th2 combinatorial chemistry 

20 may not be the preferred way to produce the product in 
large amounts. 

Thus, an embodiment of this invention is a kit comprising 
a plurality of separated organic compounds, each of the 

25 compounds characterized by having a distinguishable 
composition, encoding at least one bit of different 
information which can be determined by a physical 
measurement, and sharing at least one common 
functionality. A preferred embodiment is a kit comprising 

30 at least 4 different functional organic compounds. 

More preferred is a kit wherein said functional organic 
compounds are of the formula: 

pi^F^-e-E-C' I 
35 where F^-F^ is a linker which allows for attachment to and 
detachment from a solid particle; and 
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C-E-C' is a tag member which can be determined by a 
physical measurement, especially wherein said functional 
organic compounds differ by the number of methylene groups 
and/ or halogens, nitrogens or sulfurs present. 

5 

Also preferred is a kit wherein the £-E-C' portion is 
removed photochemically or a kit wherein the C-E-C' 
portion is removed oxidatively, hydrolytically , 
thermolytically, or reductively, 

10 

Compounds of this invention may be useful as analgesics 
and/or for the treatment of inflammatory disease, 
especially in the case of the azotricyclics acting as 
antagonists of the meurokin 1/brandykin receptor. Members 
15 of the benzodiazepine library may be useful as a muscle 
relaxant and/or tranquilizer and/or as a sedative. 
Members of the 23 million Mixed Amide Library may be of 
use in the treatment of hypertension on endothelin 
antagonists or Raynaud's syndrome. 

20 

The following examples are offered by way of illustration 
and not by way limitation. 

In one embodiment the invention is composition comprising 
25 at least 6 different components, each component having a 
distinguishable moiety. The components may be 

characterized by each moiety being substantially 
chemically stable or inert and having an identifiable 
characteristic different from each of the other moieties. 
30 Each moiety is joined to a linking group having an active 
functionality capable of forming a covalent bond, through 
a linking group to individually separable solid surfaces, 
or joined to a group which is detectable at less than 1 
nanomole. With a proviso that when the moieties are 
35 joined to the linking group, the components are physically 
segregated. Preferably, the solid supports are beads. 
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In one embodiment each component comprises molecules of 
different compounds bound to individual separable solid 
surfaces, wherein the molecules on the solid surfaces. 
Preferably, the moieties of the invention define an 
5 homologous series and/or a series of substitutions on a 
core molecule. 



The invention herein is also directed to a compound 
library comprising at least one hundred unique solid 

10 supports. In this compound library each solid support has 
(1) an individual compound bound to the solid support as 
a major compound bound to the support; and (2) a plurality 
of tags e.g. tags incapable of being sequenced, where the 
tags are individual tag molecules which are physically 

15 distinguishable in being physically separable and are 
substituted so as to be detectable at less than about a 
nanomole or have a functional group for bonding to a 
substituent which is detectable at less than about at 
nanomole. Preferably, in the compound library each solid 

20 support has at least about 6 tags. In another embodiment, 
in the compound library the tags define a binary code 
encoding the synthetic protocol used for the synthesizing 
the compound on the solid support. 



25 This invention also provides a method for determining a 
synthetic protocol encoded by separable physically 
different tags in a series and defining a binary code. In 
this method at least two tags are employed to define each 
stage of the synthetic protocol, there being at least six 

30 tags. The step of the method comprising separating tags 
by means of their physical differences and detecting the 
tags to define a binary line encoding the protocol whereby 
the synthetic protocol is determined in accordance with 
the binary line. 

35 

Compound of this invention may be useful as analgesics 
and/or for the treatment of inflammatory disease. 
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especially in the case of the azatricyclics acting as 
antagonists of the neurokinin l/brandykin receptor* 
Members of the benzodiazepine library may be useful as a 
muscle relaxant and/or tranquilizer and/ or as a sedative. 
5 Members of the 23 Mixed Amide Library may be of use in the 
treatment of hypertension on endothelin antagonists or 
Raynaud's syndrome. 

EXAMPLE 1 

10 PEPTIDE LIBRARY 

In order to encode up to 10^ different syntheses, one could 
prepare 30 different identifiers which carry individual 
tags capable of being separated one from another by 
capillary GC* For encoding a smaller number of syntheses^ 
15 fewer identifiers would be used. The tags would normally 
be prepared from commercially-available chemicals as 
evidenced by the following illustration. 
w-Hydroxyalkenes-l, where the number of methylene groups 
would vary from 1 to 5, wculd be reacted with an 
20 iodoperf luoroalkane, where the number of CFj groups would 
be 3, A, 6, 8, 10, and 12. By employing a free-radical 
catalyst, the iodoperfluorocarbon would add to the double 
bond, where the iodo group could then be reduced with 
hydrogen and a catalyst or a tin hydride. In this manner, 
25 30 different tags could be prepared. The chemical 
procedure is described by Haszeldine and Steele, J. Chem. 
Soc. (London), 1199 (1953); Brace, J. Fluor. Chem., 20, 
313 (1982). The highly fluorinated tags can be easily 
detected by electron capture, have different GC retention 
30 times, so that they are readily separated by capillary GC, 
are chemically inert due to their fluorinated, hydrocarbon 
structure and each bears a single hydroxyl functional 
group for direct or indirect attachment to particles. 

35 Before attachment to compound precursors, the tags 
(referred to as T1-T30) would be activated in a way which 
is appropriate for the chemical intermediates to be used 
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in the coit±)inatorial synthesis. By appropriate it is 
intended that a functionality would be added which allows 
for ready attachment by a chemical bond to a compound 
precursor or to the bead matrix itself • The activation 
5 process would be applied to each of the 30 different tags 
and allow these tags to be chemically bound, either 
directly or indirectly, to intermediates in the 
combinatorial compound synthesis • For example, a carboxy 
derivative could be used for coupling and upon activation 
10 the resulting carboxy group would bond to the particle. 

In the case of a combinatorial synthesis of a peptidic 
compound or other structure made of amide-linked organic 
fragments, the encoding process could consist of addition 
15 of a carboxylic acid-equipped linker. For example, the 
tag would be coupled to the tert . -butyl ester of o-nitro- 
fi-carboxybenzyl bromide in the presence of sodium hydride. 
The ester would then be hydrolyzed in dilute 
trifluoroacetic acid. 

20 

Activated identifiers would be coupled to intermediates at 
each stage in the combinatorial compound synthesis. The 
ortho-nitrobenzyl ether part of the activated identifiers 
is used to allow photochemical detachment of the tags 
25 after completing the combinatorial synthesis and selecting 
the most desirsUble compounds. The detached tags would 
then be decoded using capillary GC with electron capture 
detection to yield a history of the synthetic stages used 
to prepare the compound selected. 

30 

While there is an almost unlimited set of chemical stages 
and methods which could be used to prepare combinatorial 
libraries of compounds, we will use coupling of a-amino 
acids to make a combinatorial library of peptides as an 
35 example of an application of the encoding methodology, in 
this example, we will describe preparation of a library of 
pentapeptides having all combinations of 16 different 
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amino acids at each of the five residue positions • Such 
a library would contain 16^ members. To uniquely encode 
all members of this library, 20 detachable tags (T1-T20) 
as described above would be required. 

5 

To prepare the encoded library, we would begin with a 
large niamber (>10^) of polymer beads of the type used for 
Merrifield solid phase synthesis and functional i zed by 
free amino groups. We would divide the beads into 16 

10 equal portions and place a portion in each of 16 different 
reaction vessels (one vessel for each different a-amino 
acid to be added) . We would then add a small portion 
(e.g., 1 mol%) of identifiers to each of the amino acid 
derivatives (e.g.^ Fmoc amino acids) to be coupled in the 

15 first stage of the combinatorial synthesis. The specific 
combination of the tags incorporated into the identifiers 
added would represent a simple binary code which 
identifies the amino acid used in the first stage of 
synthesis. The 16 amino acids added would be indicated by 

20 numbers 1-16 and any such number could be represented 
chemically by combinations of the first four tags (T1-T4). 
In tables 2 and 3, a typical encoding scheme is shown in 
which the presence or absence of a tag is indicated by a 
1 or a 0, respectively. The letter T may represent 

25 either the the tag or the identifier incorporating that 
tag. 
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Table 2, A typical encoding scheme. 

5 



Amino Acid added in first 
stage 


T4 


T3 


T2 


Tl 


Niimber 1 (e«g. , glycine 


0 


0 


0 


0 


1 Number 2 (e.g., alanine) 


0 


0 


0 


1 


Number 3 (e.g., valine) 


0 


0 


1 


0 


Number 4 (e.g., serine) 


0 


0 


1 


1 


Number 5 (e.g., threonine) 


0 


1 


0 


0 


« 










• 










Number 16 (e.g., tryptophan) 


1 


1 


1 


1 



We would then carry out a standard dicyclohexyl- 
carbodiimide (DOC) peptide coupling in each of the 16 
20 vessels using the Fmoc amino acids admixed with small 
amounts of the encoding activated identifiers as indicated 
above. During the couplings, the sunino acids as well as 
small amounts (e.g., 1%) of the identifiers would become 
chemically bound to intermediates attached to the beads. 

25 

Next the beads would be thoroughly mixed and again 
separated into 16 portions. Each portion would again be 
placed in a different reaction vessel. A second amino 
acid admixed with appropriate new activated identifiers 
30 (T5-T8) would be added to each vessel and DCC coupling 
would be carried out as before. The particular mixture of 
the incorporated tags (T5-T8) would again represent a 
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simple binary code for the amino acid added in this, the 
second stage of the combinatorial synthesis. 



5 Table 3. A typical encoding scheme. 





Amino Acid added in second 
stage 


T8 


T7 


T6 


T5 


10 


Number 1 (e,g,, glycine 


0 


0 


0 


0 




Number 2 (e.g., alanine) 


0 


0 


0 


1 




Number 3 (e.g., valine) 


0 


0 


1 


0 




Nximber 4 (e.g., serine) 


0 


0 


1 


1 




Number 5 (e.g., threonine ) 


0 


1 


0 


0 


15 


. 
























Niimber 16 (e.g., tryptophan) 


1 


1 


1 


1 j 



20 After the 16 couplings of stage 2 are complete, the beads 
would be again mixed and then divided into 16 new portions 
for the third stage of the synthesis. For the third 
stage, T9-T12 would be used to encode the third amino acid 
bound to the beads using the same scheme used for stages 

25 1 and 2. After the third couplings, the procedure would 
be repeated two more times using the fourth amino acids 
with T13-T16 and the fifth amino acids with T17-T20 to 
give the entire library of 1,048,576 different peptides 
bound -to beads. 

30 



Although the above beads would be visually 
indistinguishable, any bead may be chosen (e.g., by 
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selecting based on the interesting chemical or biological 
properties of its bound peptide or other target molecule) 
and its synthetic history may be learned by detaching and 
decoding the associated tags. 

5 

The precise method used to detach tags will depend upon 
the particular linker used to chemically bind it to 
intermediates in the combinatorial synthesis of the target 
compound. In the example above, the ortho-nitrobenzyl 
10 carbonate linkages, which are known to be unstable to 
-300 nm light (Ohtsuka, et al., J. Am. Chem. 5oc. , 100, 
8210 [1978]), would be cleaved by photochemical 
irradiation of the beads. The tags would then diffuse 
from the beads into free solution which would be injected 
15 into a capillary gas chromatograph (6C) ecpiipped with a 
sensitive electron capture detector. Since the order in 
which the tags (T1-T20) emerged from the GC and their 
retention times under standard conditions were previously 
determined, the presence or absence of any of T1-T20 would 
20 be directly determined by the presence or absence of their 
peaks in the GC chroma togram. If 1 and 0 represent the 
presence and absence respectively of peaks corresponding 
to T1-T20, then the chromatogram can be taken as a 
20-digit binary number which can uniquely represent each 
25 possible synthesis leading to each member of the peptide 
library. The use of halocarbon tags which are safe, 
economical and detectable at subpicomole levels by 
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electron capture detection makes this capillary GC method 
a particularly convenient encoding scheme for the purpose. 

As an example of using the encoding scheme for the 
5 pentapeptide library above, a particular bead is 
irradiated with light to detach the tags, the solubilized 
labels injected into a capillary GC and the following 
chromatogram obtained ("Peak" line): 

10 

Label 20 19 18 17 16 15 U 13 12 11 10 9 8 7 6 5 A 3 2 1 
Peak I I I I I II II 

15 Binary 1111 0100 0011 0001 0010 

Stage 5 A 3 2 1 

AA Tryptophan Threonine Serine Alanine Valine 

20 

The "Label" line diagrams the GC chromatogram where T20-T1 
peaks (J) are to be found (note the injection is given on 
the right and the chromatogram reads from right to left) . 

25 The "Peak" line represents the presence of labels (T20-T1) 
as peaks in the chromatogram. The "Binary" line gives 
presence (1) or absence (0) of peaks as a binary number. 
The "Stage" line breaks up the binary number into the five 
different parts encoding the five different stages in the 

30 synthesis. Finally, the "AA" line gives the identity of 
the amino acid which was added in each stage and was given 
by the binary code in the "Binary" line above. 



GC Inject 



35 
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EZAMPLE 2 

RADIO-LABELED TAGS 
In the next illustration, the tags employed are 
monome thy 1 ethers of linear alkyl-a^w-diols. The diol 
5 would have N + 2 carbon atoms, where N designates the 
stage. The methyl group would be a radiolabeled reagent 
which would have any of a variety of ^H/^^C ratios from 1/1 
to m/1, where m is the number of choices. The double 
radiolabel allows for accurate quantitation of the tritium 
10 present in the tag. By having 10 different alkylene 
groups and 10 different radioactive label ratios, 10^*^ 
unique ten-member sets of tags are generated. Tags would 
be attached by first reacting them with activating agents, 
e.g. phosgene to form a chlorof ormate, followed by 
15 reaction with the F^-F^ component- In this case, F^-F^ is 
the o-nitro-p-carboxy-benzyl alcohol protected as the t- 
butyl ester. Each time a synthetic stage is carried out, 
the de-esterified identifier is added directly to the 
bead, which has covalently bonded amine or hydroxyl 
20 groups, to form amides or esters with the acid activated 
using standard chemistry, e.g., carbodiimide coupling 
methodology. At the end of the sequential synthesis, the 
beads are then screened with a variety of receptors or 
enzymes to determine a particular characteristic. The 
25 beads demonstrating the characteristic may then be 
isolated, the tags detached and separated by HPLC to give 
a series of glycol monomethyl ethers which may then be 
analyzed for radioactivity by standard radioisotope 
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identification methods. For example, if the first and 
second tags to elute from the HPLC column had ^H/^^C ratios 
of 5:1 and 7:1 respectively, then the product which showed 
activity had been synthesized by reagent number 5 in 
5 stage 1 and reagent number 7 in stage 2, 

EXAMPLE 3 

2401 Peptide Library 
The identifiers employed were 2-nitro-4-carboxyben2yl, 

10 0-aryl substituted o-hydroxyalkyl carbonate, where alkyl 
was of from three to 12 carbon atoms and aryl was (A) 
pentachlorophenyl, (B) 2,4,6-trichlorophenyl, or (C) 2,6- 
dichloro-4-f luorophenyl- The tags are designated as NAr, 
wherein N is the number of methylene groups minus two and 

15 Ar is the aryl group. Thus, tag 2A has a butylene group 
bonded to the pentachlorophenyl through oxygen. The 
subject tags can be easily detected using electron capture 
gas chromatography at about 100 frool. 

20 In tbe subject analysis, the tagging molecules are 
arranged in their GC elution order. Thus the tag which is 
retained the longest on the GC column is designated Tl and 
is associated with the least significant bit in the binary 
synthesis code number, the next longest retained tag is 

25 called T2 representing the next least significant binary 
bit, and so on. Using an 0.2mM x 20M methylsilicone 
capillary GC column, eighteen well-resolved tags were 
obtained where Tl through T18 corresponded to lOA, 9A, 8A, 
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7A, 6A, 5A, 4A, 3A, SB, 2A, 5B, lA, 4B, 3B, 2B, IB, 2C, 
and IC, respectively. 



An encoded combinatorial library of 2401 peptides was 
5 prepared. This library had the amino acid sequence N- 
XXXXEEDLGGGG-bead, where the variable X residues were D, 
1, K, L, Q, or S (single letter code). The 4 glycines 
served as a spacer between the encoded amino acid sequence 
and the bead. The combinatorial library included the 

10 sequence HjN-KLISEEDL, part of the 10 amino acid epitope 
which is known to be bound by 9E10, a monoclonal antibody 
directed against the human C-myc gene product. For 
encoding this library, three binary bits were sufficient 
to represent the seven alternative reagents for each 

15 stage. The code was as follows: 001 = S; 010 = I; Oil = 
K; 100 = L; 101 Q; 110 = E; 111 = D. 



The library was synthesized by first preparing the 
constant segment of the library H^NEEDLGGGG-bead on 1.5 g 

20 of 50-90/Lt polystyrene synthesis beads functionalized with 
1.1 meq/g of aminomethyl groups using standard solid phase 
methods based on t. -butyl side-chain protection and Fmoc 
main chain protection (Stewart and Young, "Solid Phase 
Peptide Synthesis*', 2nd edition. Pierce Chemical Co., 

25 1984). After deprotecting the Fmoc groups with 
diethylamine, the beads were divided into seven 200 mg 
fractions and each fraction placed in a different 
Merrifield synthesis vessel mounted on a single wrist- 
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action shaker. The beads in the seven vessels were 
processed independently as follows (see Table 3-1) . The 
letter T in this example refers to the tag or to the 
identifier incorporating that tag, 

5 

TABLE 3-1 



Ves 
sal 
No. 


Step 1 


Step 2 


Step 3 


Step 4 


1 


1%T1 


Die, wash 


Finoc(tBu)S^ Anh. 


Wash 


2 


1%T2 


II 


FinocI , Anh . 


It 


3 


1%T1,T2 


ft 


Finoc(Boc)K, Anh. 


ft 


4 


1%T3 


II 


FmocL^ Anh. 


ti 


5 


1%T1,T3 


II 


Piaoc(trityl)Q, 
Anh. 


If 


6 


1%T2,T3 


II 


Fmoc ( t-butyl ) E , 
Anh. 


n 


7 


1%T1,T2,T3 


II 


Finoc(tBu)D, Anh. 


It 



In accordance with the above procedure a sufficient amount 
20 of the identifiers listed in step 1 were attached via 
their carboxylic acids using diisopropylcarbodiimide to 
tag about 1% of the free amino groups on each bead in the 
corresponding vessel. The remaining free amino groups on 
each bead were then coupled in step 3 to N-protected amino 
25 acid anhydrides. After washing with methylene chloride, 
isopropanol, and N,N-dimethylformamide, the beads from the 
seven vessels were combined and thoroughly mixed. At this 
point the library had seven members. 



I 
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After Fmoc deprotection (diethylamine) , the beads were 
again divided into seven vessels and processed as before 
except that in place of the identifiers used previously, 
identifiers representing the second stage {T4-6) were 
5 used* By repeating the procedure two more times, using 
identifiers T7-9 and then TlO-12 analogously, the entire 
uniquely encoded library of 7^=2401 different peptides was 
prepared using only 12 identifiers. 



10 To read the synthesis code from a single selected bead, 
the bead was first washed four times in a small centrifuge 
tube with 100 portions of DMF, and then resuspended in 
1 ML of DMF in a Pyrex capillary tube. After 2 hrs of 
photolysis with a Rayonet 350 nm light source, the tags 

15 released from the bound identifiers were silylated using 
about 0.1 /iL bis-trimethylsilylacetamide and the solution 
injected into a Hewlett Packard capillary gas 
chromatograph equipped with an 0.2mM x 20M methylsilicone 
fused silica capillary column and an electron capture 

20 detector. The binary synthesis code of the selected bead 
was directly determined from the chromatogram of the tags 
which resulted. 
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SnKPLE 4 

Benzodiazepine Library 
A combinatorial benzodiazepine library comprising 30 
compounds of the formula VIII 

5 



CI 



10 




wherein: 

R is CHj, CHCCHj)^, CH2CO2H, (CH2)^NH2. CH^C^H^OH, or CHjC^Hj 
and 

15 is H, CH3, C2H5, CH2CH=CH2, or CHgC^Hg 

is constructed per the following scheme. 
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STEP C 




® = POLTSryEESE RESIN 



0 HNFmoc 



STEP D 



1) TAGS Kg.j 



2) 20% PIPERIDIHE IN DMF 
R ■ 

X 

FmocN CO 
» F 



3) 



0 




3) 5 ACOH/DJIF 

0 

60 C 
STEP E 
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STEP E 




STEP F 



1) LITHIATED 5(PIEmilETBYL)-2- 
OXAZOLIDINOSi; 

THF, -78 C 

2) R^X, DttF 

X=BROttINE OE IODINE 

3) TFAjHgOlDIttETHYLSULFIDE 

95:5:10 




STEP G 



hv (350 M) 
W 
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The benzodiazepines VIII are constructed on polystyrene 
beads similarly to the method of Bunin and Ellman (JACS, 
114. 10997-10998 [1992]) except that a photolabile linker 
5 is incorporated between the bead and the benzodiazepine 
(see steps A, B, and C) , thus allowing the benzodiazepine 
to be removed in step G non-hydrolytically by exposure to 
U.V. light (350 nm in DMF for 10 minutes to 12 hr) . 
Additionally, binary codes are introduced in steps D and 
10 E which allow for a precise determination of the reaction 
sequence used to introduce each of the 6 R's and 5 R^'s. 
After removal of the tags according to step H and analysis 
by electron capture detection following GC separation, the 
nature of the individual R and R^ groups is determined. 

15 

Steps E, and F essentially follow the procedure of 
Bunin and Ellman, but also include the incorporation of 
identifiers IXa-c in step D and IXd-f in Step E. The 
identifiers are all represented by Formula IX, 

20 



25 
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wherein: 

IX^ indicates n=6; 
IXjj indicates n=5; 
IXj. indicates n=4 ; 
5 IX^ indicates n-3; 

IX^ indicates n=2; and 
IX^ indicates n=l* 

The codes for each of R and are as follows: 
10 Table 4-1 



1 — 


E 


a 


CH3 


b 


CH(CH3)2 


a,b 


CHgCOjH 


c 


(CH2)^NH2 


a^c 


CH2-C^H^-4-OH 


hfC 


CHjC^Hj 


IX 


El 


d 


H 


e 


CH3 


d,e 




f 


CH2CH-CH2 


d,f 
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gtep A 

To a solution of I (1 ecjuiv) in toluene (cone. = 0.5 M) is 
added the Fmoc protected 2 -amino-5-chloro-4 '-hydroxy- 
benzophenone (1*3 eq)and diethylazaodicarboxylate (1.3 eq) 
5 and triphenylphosphine (1.3 eq) . The mixture is stirred 
at room temperature for 24 hr. The solvent is removed in 
vacuo and the residue triturated with ether and filtered 
and the solvent again removed in vacuo . The resultant 
product II is purified by chromatography on silica gel. 

10 

Step B 

To a solution of II in DCM (0.2 M) stirring at r.t. is 
added TFA (3 eguiv.) and the solution is allowed to stir 
for 12 hr. The solution is evaporated to dryness in vacuo 
15 and the residue dissolved in DCM, washed once with brine 
and dried (Na^SO^) . Filtration and evaporation of the 
solvent affords III. 



Step C 

20 1% DVB (divinylbenzene) cross-linked polystyrene beads 
(50/i) functionalized with aminomethyl groups (l.l mEq/g) 
are suspended in DMF in a peptide reaction vessel 
(Merrifield vessel) . Ill (2 equiv) and HOBt (3 equiv) in 
DMF are added and the vessel shaJcen for 10 min. DIC (3 eq) 

25 is added and the vessel is shaken until a negative 
Ninhydrin test indicates completion of the reaction after 
12 hr. 
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The DMF is removed and the resin washed with additional 
DMF (x5) and DCM (x5) before drying in vacuo . 



5 Step D 

The dry resin is divided into 6 reaction vessels and is 
suspended in DCM* The appropriate combinations of 
identifiers IX^,^ (see Table 4-1) are added to the flasks 
and shaken for 1 hr. The RhCTFA)^ catalyst (1 inol%) is 

10 added to each flask and shaken for an additional 2 hr. 
The flasks are drained and the resin washed with DCM (x5) . 
The resin is then treated with a solution of TFA in DCM 
(0.01 M) and shaken for 30 min. and then washed again with 
DCM (x3) followed by DMF (x2) . The resin is treated with 

15 a 20% solution of piperidine in DMF and shaken for 30 min. 
and is then washed with DMF (x3) and DCM (x3). 



To each flask is added the appropriate Fmoc protected 
amino acylfluoride (3 equiv) (when required side-chain 

20 functional groups are protected as tert -butyl ester (Asp) , 
tert -butyl ether (Tyr) or tert -butyloxvcarbonyl (Lys)) 
with 2,6-di-tert-butyl-4-methylpyridine (10 equiv) and the 
flasks shaken overnight or until a negative Ninhydrin test 
is achieved. The resin is washed once (DCM) and then the 

25 six batches are combined and washed again (DCM, x5) before 
drying in vacuo . 



wo 94/08051 PCT/US93/09345 

-87- 

Step E 

The dry resin is divided into five reaction vessels and is 
suspended in DCM- The appropriate combinations of 
identifiers IX^.^ (see Table 4-1) are added to the flasks 
5 and shaken for 1 hr. The Rh(TFA)2 catalyst (1 mol%) is 
added to each flask and shaken for an additional 2 hr. 
The flasks are drained and the resin washed with DCM {x5) • 
The resin in then treated with a solution of TFA in DCM 
(0,01 M) and shaken for 30 min, and is then washed with 
10 DMF (x3) and DCM (x3) • 

To each flask is added a solution of 5% acetic acid in DMF 
and the mixtures are heated to 60 'C and shaken overnight. 
The solvent is drained and then the resin washed with DMF 
15 (x5) . 



Step F 

Each batch of resin is suspended in THF and the flasks are 
cooled to -78 'C- To each flask is added a solution of 

20 lithiated 5-(phenylinethyl)-2-oxa2olidinone (2 equiv) in 
THF and the mixtures are shaken at -78 'C fori hr. The 
appropriate alkylating agent (Table 4-2) (4 equiv) is then 
added to each reaction flask followed by a catalytic 
amount of DMF. The vessels are allowed to warm to ambient 

25 temperature and shaken at this temperature for 5 hrs. The 
solvent is removed by filtration and the resin washed with 
THF (xl) and then dried in vacuo . The batches of resin 
are then combined and washed with THF (x2) and DCM (x2) 
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and the combined resin is then treated with a 95:5:10 

mixture of TFA: water rdimethylsulphide for 2 hrs to remove 
the side chain protecting groups. 

TABLE 4-2 



IDENTIFIER 


ALKYLATING 
AGENT 


e 


HjCI 




CjHjBr 


1 f 


BrCH2-CH=CH2 


d,f 


BrCHjCjHj 



10 

step G 

The resultant benzodiazepine can be cleaved from a bead of 
polystyrene by suspending the bead in DMF and irradiating 
with U-V. (350 nm) for 12 hrs. 

15 

Step H 

A bead of interest is placed into a glass capillary tube. 
Into the tube is syringed 1 of IM aqueous cerium (IV) 
ammonium nitrate (CAN) solution, 1 /iL of acetonitrile and 
20 2mL of hexane. The tube is flame sealed and then 
centrifuged to ensure that the bead is immersed in the 
reagents. The tube is placed in an ultrasonic bath and 
sonicated from 1 to 10 hrs preferably from 2 to 6 hrs. 
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The tube is cracked open and -1 fiL of the upper hexane 
layer is mixed with mL of bis{triinethylsilyl) - 

acetamide (BSA) prior to injection into the GO and each 
tag member determined using electron capture detection, as 
5 exemplified in the following scheme. 



-90- 
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EZAMPLE 5 

117,649 Peptide Library 
An encoded library of 117,649 peptides was prepared^ This 
library had the sequence HjN-XXXXXXEEDLGGGG-bead, where the 
5 variable residue X was D,E,I,K,L,Q or S. This library was 
encoded using the 18 tags as defined in Example 3; three 
binary bits being sufficient to represent the seven amino 
acids used in each step. The code was: 001=S; 010=1; 
011=:=K; 100=L? lOl^Q; 110==E; and lll^D, where 1 indicates 
10 the presence and 0 indicates the absence of a tag. 

The constant segment of the library (H^NEEDLGGGG-bead) was 
synthesized on 1.5 g of 50-80 ^ Merrifield polystyrene 
synthesis beads functionalized with 1.1 mEc[/g of 
aminomethyl groups using standard solid phase methods 
based on t-Bu sidechain protection and Fmoc mainchain 
protection. After deprotecting the N-terminal Fmoc 
protecting group with diethylamine, the beads were divided 
into seven 200 mg portions, each portion being placed into 
a different Merrifield synthesis vessel mounted on a 
single wrist-action shaker. 

The beads in the seven vessels were processed as in Table 
3-1 to attach the sets of identifiers (T1-T3) and the 
25 corresponding amino acid to each portion except that 
instead of DIC, i-butylchloroformate was used for 
activation. 



15 



20 



wo 94/08051 



PCr/US93/09345 



-92- 

This procedure first chemically attached small amounts of 
appropriate identifiers via their carboxylic acids to the 
synthesis beads. This attachment was achieved by 
activating the linker carboxyl groups as mixed carbonic 
5 anhydrides using iso butvlchlorof ormate , and then adding an 
amount of activated identifier corresponding to l% of the 
free amino groups attached to the beads. Thus, about 1% 
of the free amino groups were terminated for each 
identifier added. The remaining free amino groups were 
10 then coupled in the usual way with the corresponding 
protected amino acids activated as their symmetrical 
anhydrides. 



After washing, the seven portions were combined and the 
15 Fmoc protected amino groups were deprotected by treatment 
with diethylamine. The beads were again divided into 
seven portions and processed as before, except that the 
appropriate identifiers carrying tags T4, T5, and T6 were 
added to the reaction vessels. 
20 The procedure of dividing, labelling, coupling the amino 
acid combining and main-chain deprotection was carried out 
a total of six times using identifiers bearing tags Tl- 
T18, affording an encoded peptide library of 117,649 
different members. 



25 
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Typical Identifi er Preparation 

To a solution of 8-broino-l-octanol (0.91 g, 4.35 minol) and 
2 , 4 , 6-trichlorophenol (1.03 g, 5.22 imol) in DMF (5 ©L) 
was added cesiim carbonate (1.70 g, 5.22 mstol) resulting 
5 in the evolution of gas and the precipitation of a white 
solid. The reaction was stirred at 80* C for 2 hrs. The 
mixture was diluted with toluene (50 mL) and poured into 
a separatory funnel, washed with 0.5 N NaOH (2x50 mL) , IN 
HCl (2x50 nL) and water (50 nL) and the organic phase was 
10 dried (MgSO^) . Removal of the solvent by evaporation gave 
1.24 g (87% yield) of tag as a clear oil. 



The above tag (0.81 g, 2.5 wboI) was added to a 2 M 
solution of phosgene in toluene (15 mL) and stirred at 

15 room temperature for 1 hr. The excess phosgene and the 
toluene were removed by evaporation and the resulting 
crude chloroformate was dissolved in DCM (5 mL) and 
pyridine (0.61 mL, 7.5 mmol). tert-Butyl 4-hydroxy- 
inethyl-3-nitroben2oate (Barany and Albericio, J. Am. Chem. 

20 Soc, (1985), 107 . 4936-4942) (0.5 g, 1.98 mmol) was added 
and the reaction mixture stirred at room temperature for 
3 hrs. The solution was diluted with ethyl acetate (75 
mL) and poured into a separatory funnel. After washing 
with IN HCl (3x35 mL) , saturated NaHCOj (2x35 mL) and brine 

25 (35 mL) , the organic phase was dried (MgSO^) . The solvent 
was removed by evaporation and the residue purified by 
chromatography on silica gel (5% to 7.5% ethyl acetate in 
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petroleum ether) affording 0.95 g (79% yield) of the 
identifier tert-butyl ester as a clear oil* 

Trifluoroacetic acid (3 mL) was added to a solution of the 
5 identifier tert-butyl ester (0.95 g, 1.57 sixnol) in DCM (30 
mL) to deprotect the linker acid (i.e., F^-F^ of Formula I) 
and the solution was stirred at room temperature for 7 
hrs. The mixture was then evaporated to dryness and the 
residue redissolved in DCM (30 mL) . The solution was 
10 washed with brine (20 mL) and the organic phase dried 
(MgSO^) . Removal of the solvent by evaporation gave 0.75 
g (87% yield) of the identifier (6B) as a pale yellow 
solid. (Tag nomenclature is the same as in Example 3) . 

15 Typical Encoded Librarv Synthesis Step 

Na-Fmoc-E (tBu) -E (tBu) -D(tBu) -L-G4-NH-resin was suspended 
in DHF (20 mL) and shaken for 2 min. After filtering, 1:1 
diethylamine:DMF (40 mL) was added to remove the Fmoc 
protecting groups and the resin was shaken for 1 hr. The 

20 resin was separated by filtration and washed with DMF 
• (2x20 mL, 2 min each); 2:1 dioxane: water (2x20 mL, 5 min 
each), DMF (3x20 mL, 2 min each), DCM (3 x 20 mL, 2 min 
each) then dried in vacuo at 25" C. (The resin was found 
to have 0.4 mmol/g amino groups by picric acid titration 

25 at this stage.) 

150 mg Portions of the resin were placed into seven 
Merrifield vessels and suspended in DCM (5 mL) . The 
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appropriate identifiers were activated as their acyl 
carbonates as follows (for the first coupling): Tl (6.6 
mg, 0.0098 minol) was dissolved in anhydrous ether (2 mL) 
and pyridine (10 ^L) was added. Isobutyl chloroformate 
5 (1.3 ML, 0.0096 nmol) was added as a solution in anhydrous 
ether (0.1 mL) . The resulting mixture was stirred at 25' 
C for 1 hr. during which time a fine white precipitate 
formed. The stirring was stopped and the precipitate was 
allowed to settle for 30 min. Solutions of the 
10 acylcarbonates of T2 and T3 were prepared in the same way. 
Aliquots (0.25 mL) of the supernatant solution of 
activated identifiers were mixed to give the appropriate 
3-bit binary tag codes and the appropriate coding; mixtures 
of identifiers were added to each of the seven synthesis 
15 vessels. The vessels were shaken in the dark for 12 hrs, 
and then each was washed with DCH (4x10 mL, 2 min each) . 
A solution of the symmetrical anhydride of an Na-Fmoc 
amino acid in DCM (3 equivalents in 10 mL) was then added 
to the corresponding coded batch of resin and shaken for 
20 20 min. 5% N,N-diisopropylethylamine in DCM (1 mL) was 
added and the mixture shaken until the resin gave a 
negative Kaiser test. 

The resin batches were filtered and combined, and then 
25 washed with DCM (4x20 mL, 2 min each) , isopropanol (2x20 
mL, 2 min each), DCM (4x20 mL, 2 min each). The next 
cycle of labelling/coupling was initiated by Fmoc 
deprotection as described above. 
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After Fmoc deprotection of the residues in the last 
position of the peptide, the side chain functionality was 
deprotected by suspending the resin in DCM (10 mL) , adding 
thioanisole (2 mL) , ethanedithiol (0.5 mL) and tri- 
5 fluoroacetic acid (10 mL) then shaking for 1 hr at 25* c. 
The resin was then washed with DCM (6x20 mL, 2 min each) 
and dried. 

Electron Capture Gas Chromatography Reading of Code 
10 A single, selected bead was placed in a Pyrex capillary 
tube and washed with DMF (5x10 ^L) . The bead was then 
suspended in DMF (1 mL) and the capillary was sealed. The 
suspended bead was irradiated at 366 nm for 3 hrs to 
release the tag alcohols, and the capillary tube 

15 subsequently placed in a sand bath at 90* c for 2 hrs. 
The tube was opened and bis-trimethylsilyl acetamide (0.1 
mL) was added to trimethylsilylate the tag alcohols. 
After centrifuging for 2 min., the tag solution above the 
bead (1 ^L) was injected directly into an electron capture 

20 detection, capillary gas chromatograph for analysis. Gas 
chromatography was performed using a Hewlett Packard 
Series II Model 5890 gas chromatograph equipped with a 0.2 
mmx20 m methyl silicone fused silica capillary column and 
an electron capture detector. Photolysis reactions were 

25 performed using a UVP "Black Ray" model UVL 56 hand-held 
366 nm lamp. 



wo 94/08051 PCr/US93/09345 

-97- 

Antibodv Affinity Methods 

The anti-C-myc peptide monoclonal antibody 9E10 was 
prepared from ascites fluid as described in Evans et al. , 
Mol. Cell Biol., 5, 3610-3616 (1985) and Munro and Pelham, 
5 Cell, 4S, 899-907 (1987). To test beads for binding to 
9E10, beads were incubated in TBST [20 wfL Tris-HCl (pH 
7.5), 500 mM NaCl and 0.05% Tween-20] containing 1% bovine 
serum albumin (BSA) to block non-specific protein binding 
sites. The beads were then centrifuged, resuspended in a 
10 1:200 dilution of 9E10 ascites fluid in TBST + 1% BSA and 
incubated overnight at 4*C. Beads were subsequently 
washed three times in TBST and incubated for 90 min. at 
room temperature in alkaline phosphatase-coupled goat 
antimouse IgG antibodies (Bio-Rad Laboratories) , diluted 
15 1:3000 in TBST + 1% BSA. After washing the beads twice in 
TBST and once in phosphatase buffer (100 mM Tris-HCl, pH 
9.5, 100 mM NaCl and 5 mM MgCl^) , the beads were incubated 
1 hr at room temperature in phosphatase buffer containing 
one one-hundreth part each of AP Color Reagents A & B 
20 (Bio-Rad Laboratories) . To stop the reaction, the beads 
were washed twice in 20 mM sodium EDTA, pH 7.4. Solution 
phase affinities between 9E10 and various peptides were 
determined by a modification of the competitive ELISA 
assay described by Harlow et al. . Antibodies: a Laboratory 
25 Manual, 570-573, Cold Spring Harbor Press, Cold Spring 
Harbor, N.Y. 
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From a 30 mg sample of the combinatorial library of 
peptides, 40 individual beads were identified which 
stained on exposure to the anti-C-myc monoclonal antibody* 
Decoding of these positive-reacting beads established the 
5 ligand's reaction sequence as the myc epitope (EQKLISEEDL) 
or sequences that differed by one or two substituents 
among the three N-terminal residues. 

EXAMPLE 6 

10 23.540,625 Mixed Amide Library 

The encoding technicpie was tested further by the 
preparation of a coxabinatorial library of 23,540,625 
members consisting of peptides and other amide compounds. 

15 The synthesis was carried out using 15 different reagents 
in 5 steps and- 31 different reagents in the sixth step. 
Four identifiers were used to encode each of the 5 steps 
with 15 reagents and five identifiers were used in the 
final step with 31 reagents. A label set of 25 

20 identifiers was therefore prepared. 2-Nitro-4- 
carboxybenzyl, 0-aryl substituted w-hydroxyalkyl carbonate 
identifiers were employed, where the tag components were 
comprised of an alkyl moiety of from 3 to 12 carbon atoms 
and the aryl moieties were (A) pentachlorophenyl, (B) 
25 2, 4, 5-trichlorophenyl, (C) 2,4,6-trichlorophenyl, or (D) 
2,6-dichloro-4-fluorophenyl. A set of 25 tags was 
prepared using appropriate alkyl chains lengths with A, B, 
C or D, separable using a 0.2 mMx25M methyls ilicone GC 
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column. The chemical compositions of tags T1-T25 (where 
Tl represents the tag with the longest retention time, and 
T25 the tag with the shortest retention time) are 
summarized below: 



Tl 


lOA 


T6 


IOC 


Til 


7B 


T16 


50 


T21 


2B 


T2 


9A 


T7 


9B 


T12 


7C 


T17 


4B 


T22 


2C 


T3 


8A 


T8 


9C 


T13 


6B 


T18 


4C 


T23 


IB 


T4 


7A 


T9 


83 


T14 


6C 


T19 


3B 


T24 


IC 


T5 


lOB 


TIO 


8C 


T15 


5B 


T20 


3C 


T25 


2D 



The designations lOA^ 9A, etc. are as described in Example 



15 



The fifteen reagents used in the first five stages and the 
code identifying them are represented below where 1 
represents the presence of tag and O the absence thereof. 
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REAGENT 


CODE 


L-serine 


(0001) 


D-serine 


(0010) 


L-glutamic acid 


(0011) 


D-glutamic acid 


(0100) 


L-glutasiine 


(0101) 


D-glutamine 


(0110) 


L-lysine 


(0111) 


D-lysine 


(1000) 


L-Proline 


(1001) 


D-Proline 


(1010) 


L-phenylalanine 


(1011) 


D-pheny 1 a 1 anine 


(1100) 


3 -amino-benzoic 
acid 


(1101) 


4 -aminopheny 1 
acetic acid 


(1110) 


3 , S-diamino- 
benzoic acid 


(1111) 
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The 31 reagents and the code representing them in the 
sixth stage are represented belov: 



REAGENT 


CODE 


Ti— sei*inG 


f 00001) 


*^ V> A, ^* A * V> 


roooioi 

^ W W W ^ W J 


^ : 

y Tj— cf 111 tain ie acid 


^ v/v/ VXX / 




\ \J vj X »J w y 


xj ^ j» w»* m X i ic 


/00101 \ 




\ wwX X V / 




\ Vs/XXX j 




f m nnn \ 
( UXUUU ) 




\ uxuwx / 




m n) 

\ 1#X wXU y 


pxi eriy i o. i a. n X n t= 


^ UXUxX / 


U U {^nciljfXaXCIIlXlXC 


l^XXUU/ H 


3*— ainino^benzolc dcid 


(01101) 


j 4-aininophenyl acetic acid 


(OHIO) 


1 3 , S-diamino-benzoic acid 


(01111) 


Succinic Anhydride 


(10000) 


Tiglic acid 


(10001) 


2-pyrazine carboxylic acid 


(10010) 
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(±)thioctic acid 


(10011) 


1-piperidinepropionic acid 


(10100) 


1 piperonylic acid 


(10101) 


e-methylnicotinic acid 


(10110) 1 


3- (2-thienyl) acrylic acid 


(10111) 1 


methyl iodide 


(11000) 


tosyl chloride 


(11001) 


p-toluenesulfonyl isocyanate 


(11010) 


1 3-cyanobenzoic acid 


(11011) 


1 phthallic anhydride 


(11100) 


acetic anhydride 


(11101) 


ethyl chloroformate 


(11110) 


mesy 1 chl or ide 


(11111) 



15 A spacer of six glycine units was prepared on the 

beads using standard methods. The variable region was 
constructed using butyl sidechain protection, and amino 
groups were protected as Fmoc derivatives. Amide bonds 
were formed by activation of the carboxylic acid with DIG 

20 and HOBt* 
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EXMIPLE 7 

Hetero-Diels-Alder Library 
A combinatorial hetero Diels-Alder library comprising 42 
compounds of the formula: 

5 




wherein ; 

is H, CH3O, F3C, F3CO, HjC^O, or C^H^^; 
is H, CH3, or CH3O; 
R^ is H (when n=2), or CH3 (when n=l) ; and 

15 
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was constructed per the following scheme: 
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IV 



1) Toluene 

A 



2) Identifiers 




R 

STEP D 




STEP E 
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STEP I 




bv (350 m) 
DMT 
STEP r 




STEP G Ce(NH4)2(K03)6 




HO 
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The azatricyclic products (VI) were constructed on 
polystyrene beads and were linked to the beads by a 
photocleavable linker allowing the azatricycle {VII) to be 
5 removed from the bead by exposure to U.V. light (350 nm in 
DMF) . The binary codes introduced in steps C,D and E 
allow a unique deteimiination of the reaction sequence used 
to introduce ArR, r\ and R-'. The encoding tags were 
removed according to step G and analyzed by electron 
10 capture detection following GC separation. 

The identifiers used in this scheme are represented by the 
formula X: 



15 




Wherein; 
25 X3 indicates n=10 
indicates n=9 
indicates n=8 
indicates n=7 
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indicates n=6 



X. indicates n=5 



Xg indicates n=4 



5 The codes for each of R, r\ R^, R^ are as follows: 

TABLE 7-1 



10 



Ar = 




R = K 



15 b 



Ar = 




E = CI 



20 a,b 



Ar = 




R^*=H r2=H 



25 d 



r1«H r2=CH, 



d,c 



R^=0CH3 R^^OCHj 
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10 



e 



R^=CF3 R^H 



B,C R^=C^H50 r2=H 

5 e,d R^=F3C0 r2=H 



R^=CH3 n«l 



R^=H n=2 



Step A 

15 To a solution of I (2.03 g, 8 annol) , 4 -hydroxybenz aldehyde 
(1.17 g, 9.6 iniaol) and triphenylphosphine (2.73 10.4 
minol) in toluene (20 mL) stirring at 0*C was added over a 
period of 30 minutes diethylazodicarboxylate. The 
solution was allowed to warm and stirred for 1 hour once 

20 ambient temperature had been reached. The solution was 
concentrated by removal of approximately half of the 
solvent in vacuo and was then triturated with ether. The 
mixture was then filtered and the residue was washed 
thoroughly with ether. The solvent was removed in vacuo 

25 and the residue was purified by chromatography on silica 
gel (15% ethyl acetate in hexane) affording 1.3 g of the 
ether Ila (47% yield) . 
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2-chloro-4-hydroxybenzaldehyde and 2 -hydroxy- 1- 
naphthaldehyde were coupled to I in analogous fashion 
affording ethers lib and c in yields of 91% and 67%, 
respectively, 

5 

Step B 

To a solution of ether Ila (0.407 g, 1,14 mmol) in DCM (20 
mL) stirring at room temperature was added TFA (8 mL) . 
The solution was allowed to stir for 6 hrs. The solution 
10 was evaporated to dryness in vacuo affording 0.343 g of 
acid Ilia (100% yield) . Ethers lib and lie were 
deprotected analogously affording acids Illb and c in 
yields of 92% and 100% respectively. 

15 step c 

Into a peptide reaction vessel (Merrifield vessel) were 
measured 1% DVB (divinylbenzene) cross-linked polystyrene 
beads (SO-SOju) functional i zed with aminomethyl groups (l.l 
rosq/g) (200 mg of resin) • The resin was suspended in DMF 

20 (2 mL) and shaken for 20 min. The acid Ilia (38 mg, 2 
equiv.), l-hydroxybenzotriazole (40 mg, 2 equiv) and 
diisopropylcarbodiimide (38 mg, 2 equiv) were added and 
the mixture shaken until a negative Ninhydrin test was 
achieved (22 hr) . The solution was removed by filtration 

25 and the resin was washed with DCM (8x 10 mL) . 

The resin was resuspended in DCM (5 mL) , identifier Xa (15 
mg) was added and the flask was shaken for 1 hr. Rh(TFA)2 
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catalyst (1 mol%) was added and the flasks shaken for 2 
hrs* The solvent was removed by filtration and the resin 
resuspended in DCM (5 mL) . Trif luoroacetic acid (1 drop) 
was added and the vessel shaken for 20 min. The solvent 
5 was removed by filtration, and the resin was washed with 
DCM (8x 10 inL). 

In an analogous fashion, acids Illb and IIIc were attached 
to the resin and were encoded with the appropriate 
10 identifiers, i.e., Xb for acid Illb and Xa and Xb for acid 
IIIc. The three batches of resin were combined, mixed, 
washed, and dried. 

Step D 

The dry resin was divided into 7 equal portions (87 mg) 
which were put into seven peptide reaction vessels 
(Merrifield vessels) which were wrapped with heat tape. 
The resin in each vessel was suspended in toluene (10 mL) 
and shaken for 20 min. An appropriate amount of one 
aniline was then added to each flask (see Table 7-2). 



15 



20 
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TABLE 7-2 



FLASK 


ANILINE 


AMOUNT ADDED 


1 ^ 


Aniline 


3 ML 


2 


3 , 5-dimethylaniline 


3 loL 


3 


3,4, 5-trimethoxyaniline 


2 g 


4 


4 - t r i f luor ome thy 1 anil ine 


3 niL 


5 


4 -phenoxyanil ine 


2 g 


6 


4 -tr if luoromethoxy ani 1 ine 


3 mL 


7 


4 -cyclohexyl anil ine 


2 g 



10 

The heating tape was connected and the reaction 
mixtures shaken at 70 'C for 18 hrs. The heat tape was 
disconnected and the solvent was removed by filtration and 
each batch of resin was washed with dry DCM (4x 10 mL) , 

15 ether (10 mL) , toluene (10 mL) and DCM (2x 10 mL) . Each 
of the portions was then suspended in DCM (5 mL) and to 
each flask was added the appropriate identifier or 
combination of identifiers (Xc-e) (15 mg) (see Table 7-1) . 
The flasks were shaken for 1 hr. and then Rh(TFA)2 (1 mol%) 

20 was added to each flask and shaking continued for 2 hrs- 

The solvent was then removed and each batch of resin was 
re-suspended in DCM (5 mL) to which was added TFA (1 
drop) • This mixture was shaken for 20 min. , then the 
25 solvent was removed by filtration. The batches of resin 
were then washed (DCM, Ix 10 mL) and combined, washed 
again with DCM (3x 10 mL) and then dried thoroughly jj\ 
vacuo . 

30 Step E 

The dried resin was divided into two equal portions (0.3 
g) and each was placed in a peptide reaction vessel. The 
resin batches were washed with DCM (2x 10 mL) and then 
resuspended in DCM (5 mL) • To one flask was added the 
35 identifier Xf (15 mg) and to the other was added Xg (15 
mg) . The flasks were shaken for 1 hr. prior to the 
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addition of Rh(TFA)2 catalyst (1 inol%) . The flasks were 
shaken for 2 hrs. and then the solvent was removed by 
filtration* Each batch of resin was washed with DCM (3x 
10 mL) , and each was then resuspended in DCM (5 inL) . 

5 

The appropriate enol ether (1 mL) (see Table 7-1) was added 
to the flasks and the vessels shaken for 30 min. To each 
flask was added a solution of BF3-OEt2 (0.5 mL of a 5% 
solution in DCM) and the flasks were shaken for 24 hrs. 
10 Removal of the solvent by filtration was followed by 
washing of the resin with DCM (10 mL) and the resin was 
then combined. The beads were then washed further with 
DCM (5x 10 mL) , DMF (2x 10 mL) methanol (2x 10 mL) and DCM 
(2x 10 mL) • The resin was then dried thoroughly in vacuo . 

15 

Step F 

To confirm the identity of the products produced in the 
Hetero-Diels-Alder library one example was completed on a 
large scale to allow confirmation of the structure by 

20 spectroscopic means. The procedure followed was 
essentially the same method as described for the 
combinatorial library. In step A 4-hydroxybenzaldehyde 
was coupled to the photolabile group. In step D, aniline 
was condensed with the aldehyde- In step E, the enol 

25 ether was formed with 4 , 5-dihydro-2-methylfuran. 

The photolysis of the compound (step F) was performed by 
suspending 100 mg of the beads in DMF (0.3 mL) and 
irradiating the beads with UVP "Black Ray" model UVL 56 

30 hand-held 366 nm lamp for 16 hrs. The DMF was removed to 
one side by pipette and the beads rinsed with additional 
DMF (2x 3 BiL) . The original solution and the washings 
were combined and the solvent removed in vacuo . NMR 
analysis of the reaction mixture showed it to contain the 

35 desired azatricycle by comparison to the authentic sample. 
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Step G 

A bead of interest was placed into a pyrex glass capillary 
tube sealed at one end* A solution (1 /iL) of IM aqueous 
5 cerium (IV) ammonium nitrate and acetonitrile (1:1) was 
syringed into the tube, and the tube was then centrifuged 
so that the bead lay on the bottom of the capillary and 
was completely immersed by the reagent solution* Hexane 
(2 ML) was added by syringe and the tube was again 

10 centrifuged. The open end of the capillary was flame- 
sealed and placed in an ultrasonic bath for 4 hrs* The 
capillary was then placed inverted into a centrifuge and 
spun such that the aqueous layer was forced through the 
hexane layer to the bottom of the tube* This extraction 

15 process was repeated 3 or 4 times and the tube was then 
opened. The hexane layer (1.5 /xL) was removed by syringe 
and placed into a different capillary containing BSA (0.2 
ML) « This tube was sealed and centrifuged until the 
reagents were thoroughly mixed. A portion of the solution 

20 (ca. 1 mL) was removed and injected into a gas 
chromatography machine with a 25M x 0.2 mM methyl silicone 
fused silica column with electron capture detection for 
separation and interpretation of the tag molecules. 

25 The sample was injected onto the GO column at 200*0 and 25 
psi of carrier gas (Heg) . After 1 minute the temperature 
was increased at a rate of 20 'C per minute to 320* C, and 
the pressure was increased at a rate of 2 psi per minute 
to 40 psi. These conditions are shown in the following 

30 diagram: 
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GC CONDITIOHS 



5 

TEMPERATUBE 




320 C 



1 Din 




25 The following results were obtained with four randomly 
selected beads: 

Bead 1 







TAG DETECTED 






Xf 


Xe Xd Xc 


Xb Xa 


Ar 






2 -Hydroxy naphthyl 


r1 












H 






CH, (n=l) 
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Bead 2 





TAG DETECTED 




Xg 


Xe Xd Xc 


Xb 


Ar 






2 -chloro-4 -hydroxy-phenyl 














H 




r3 


H (n=2) 











Bead 3 








TAG DETECTED 






Xg 


Xe Xd 


Xb Xa 


Ar 






2 -Hydroxy naphthyl 






F,CO 








H 




r3 


H (n«2) 











Bead 


4 




TAG DETECTED 




Xf 


Xe Xd 


Xb 


Ar 






2 -chloro-4 -hydroxyphenyl 






F,CO 




r2 




H 




r3 


CH, (n=l) 







wo 94/08051 PCr/US93/09345 

-117- 

EXAMPLE 8 

Benzodiazepine Library 

Following the procedure of Example 4, a combinatorial 
5 library is constructed of the Formula X 



10 



15 




20 R is a radical of a naturally occurring D or L amino acid; 

is H, C^-C^ alkyl, lower alkenyl, C,-C^ alkylamine, 

carboxy C^-C^ alkyl, or phenyl C^-C^ alkyl wherein the 

phenyl is optionally substituted by lower alkyl, F, CI, 

Br, OH, NH^r CO2H, or 0-lower alkyl; 
25 r2 is H or CO^; 

R^ is H or OH; 

R^ is H or CI; 

with the provisos that when r' is OH, R^ is H and when R^ 
is carboxy, R^ is 

30 
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This library is released from a plurality of encoded beads 
of the general formula 



10 




15 wherein 

IX^ is a plurality of identifiers of the Formula la wherein 

said plurality represents an encoded scheme; 
S is a substrate; 

pi/_p2 ^YiB residue of the linker member of Formula la; 
20 and 

r\ R^, and R^ are as defined for Formula X. 
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EXAMPLE 9 

Typical Identifier Preparations 
The diazo compound identifiers which are attached to the 
resin via carbene formation are prepared as exemplified. 

5 

Compounds of the general formula 



10 




15 

wherein 

n is O-IO and 

Ar is pentachlorophenol , 2 , 4 , 6-tr ichlorophenol , 
20 2,4, 5-trichlorophenol , or 2 , 6-dichloro-4-f luorophenol 

are prepared as follows. 

To a solution of l-hydroxy-4- (2 , 6-dichloro-4-f luoro- 
phenoxy) butane (0.38 g, 1.5 mmol) , methyl isovanillate 

25 (0.228 q, 1.5 mmol) and triphenylphosphine (0.393 g, 1.5 
mmol) in THF (8 mL) was added diethylazodicarboxylate 
(0.287 g, 1.7 mmol). The solution stirred at r.t. for 36 
hrs. The solvent was removed in vacuo and the residue 
purified by chromatography on silica gel (with a mixture 

30 of 20% ethyl acetate and 80% petroleum ether) affording 
0.45 g of the aldehyde (77% yield). 

The aldehyde (100 mg, 0.26 mmol) was dissolved in acetone 
(8 mL) and was treated with a solution of KMnO^ (61 mg, 
35 0.39 mmol) in acetone (4 mL) and water (4 mL) . The 
reaction stirred at room temperature for 13 hrs. The 
mixture was diluted with ethyl acetate (100 mL) and water 
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(50 mL) and the layers were separated. The aqueous layer 
was extracted with additional ethyl acetate (2x 100 mL) • 
The combined organic layers were washed with water (50 mL) 
and dried (MgSO^) . Removal of the solvent afforded 109 mg 
5 of the benzoic acid (93% yield). 

A solution of the acid (76 mg, 0.188 mmol) in methylene 
chloride (2 mL) was treated with oxalylchloride (36 mg, 
0.28 mmol) and catalytic DMF. After stirring for 10 min 

10 at room temperature slow but steady evolution of gas was 
observed. Stirring continued for 2 hrs. when the solution 
was diluted with DCM (15 mL) and washed with saturated 
aqueous sodium hydrogencarbonate solution ( 5 mL) . The 
layers were separated. The organic layer was dried 

15 (Na^SO^) and the solvent evaporated affording the benzoyl 
chloride as pale yellow crystals. 

The benzoyl chloride was dissolved in methylene chloride 
(5 mL) and was added to a stirring solution of 

20 diazomethane in ether at -78*C. The cold bath was allowed 
to warm up and the mixture allowed to stir for 5 hrs at 
room temperature. The solvents and excess diazomethane 
were removed in vacuo and the residue purified by 
chromatography on silica gel using gradient elution method 

25 where the concentration of ethyl acetate ranged from 10% 
to 40 % in hexanes affording 48 mg of the diazo compound 
(60% yield) • 
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Compoxinds of the general formula: 



5 




10 



wherein ; 
15 n is 0-10 and 

Ar is pentachlorophenol , 2,4,6-trichlorophenol, 2,4,5- 

trichlorophenol , or 2 , 6-dichloro-4-f luorophenol 
are prepared as follows. 

20 Methyl vanillate (0.729 g, 4.0 nmole) , l-hydroxy-9- 
(2,3,4,5, 6-pentachlorophenoxy)nonane (1.634 g, 4.0 ninole) 
and triphenylphosphine (1.259 g, 4.8 nunole) were dissolved 
in 20 mL dry toluene under argon. DEAD (0.76 mL, 0.836 g, 
4.8 ininole) was added dropwise, and the mixture was stirred 

25 at 25 for one hour. The solution was concentrated to 
half volume and purified by flash chromatography eluting 
with DCM to give 1.0 g (1.7 mmole, 43%) of the product as 
a white crystalline solid. 

30 The methyl ester above (1.0 g, 1.7 mmole) was dissolved in 
50 mL THF, 2 mL water was added followed by lithium 
hydroxide (1.2 g, 50 mmole). The mixture was stirred at 
25 for one hour then refluxed for five hours. After 
cooling to 25 'C the mixture was poured onto ethyl acetate 

35 (200 mL) and the solution was washed with 1 M HCl (50 mL 
X3) then sat. aq. NaCl (Ix 50 mL) and dried over sodium 
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sulfate. The solvent was removed and the crude acid 
azeotroped once with toluene, 



The crude material above was dissolved in 100 mL toluene, 
5 10 mL (1.63 g, 14 mmole) thionyl chloride was added, and 
the mixture was refluxed for 90 min* The volume of the 
solution was reduced to approximately 30 mL by 
distillation, then the remaining toluene removed by 
evaporation. The crude acid chloride was dissolved in 20 

10 mL dry DCM and cooled to -78 under argon and a solution 
of approximately 10 mmole diazomethane in 50 mL anhydrous 
ether was added. The mixture was warmed to room 
temperature and stirred for 90 min. Argon was bubbled 
through the solution for 10 min. then the solvents were 

15 removed by evaporation and the crude material was purified 
by flash chromatography eluting with 10-20% ethyl acetate 
in hexane. The diazoketone (0.85 g, 1.4 mmole, 82% over 
three steps) was obtained as a pale yellow solid. 

20 The following identifiers have been prepared as described 
above : 

Photolabile Cleavage 

50 Identifiers were prepared of the formula: 

25 



30 



35 
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and n is 1,2,3,4,5,6,7,8,9, and 10. 

Oxidative Cleavage Type I 

7 Identifiers were prepared of the formula 




and n is 4,5,6,7,8,9, and 10. 

20 

Oxidative Cleavage Type II 

13 Identifiers were prepared of the formula 



25 




wherein: 
30 Ar is 



CI 



35 




CI 
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and n is 1,2,3,4,5,6,7,8,9,10; 
and wherein: 
3Ar is 



5 




CI 

and n is 0,3, and 9. 

10 



15 
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It is evident from the above description that the subject 
invention provides a versatile, simple method for 
identifying compounds, where the amount of compound 
5 present precludes any assurance of the ability to obtain 
an accurate determination of its reaction history. The 
method allows for the production of extraordinarily large 
numbers of different products, which can be used in 
various screening techniques to determine biological or 

10 other activity of interest. The use of tags which are 
chemically inert under the process conditions allows for 
great versatility in a variety of environments produced by 
the various synthetic techniques employed for producing 
the products of interest. The tags can be readily 

15 synthesized and permit accurate analysis, so as to 
accurately define the nature of the composition. 

All publications and patent applications cited in this 
specification are herein incorporated by reference as if 
20 each individual publication or patent application were 
specifically and individually indicated to be incorporated 
by reference. 

Although the foregoing invention has been described in 
25 some detail by way of illustration and example for 
purposes of clarity of understanding, it will be readily 
apparent to those of ordinary skill in the art in light of 
the teachings of .this invention that certain changes and 
modifications may be made thereto without departing from 
30 the spirit or scope of the appended claims. 
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WHAT IS CIAIMED IS; 

!• A method for recording the reaction history of a 
reaction series on each of a plurality of unique 
solid supports, wherein said reaction series involves 
5 at least two stages requiring differing agents or 

reaction conditions resulting in a different 
modification as to a plurality of said unique solid 
supports, resulting in a plurality of different final 
products on different unique solid supports, 
10 employing a combination of identifiers for recording 

said reaction history, said identifiers characterized 
by defining the choice of agent or reaction condition 
and the stage in said reaction series and being 
capable of being analyzed as to the choice and stage, 
15 said method comprising: 

reacting, at a first or intermediate stage of said 
series, a different agent or employing a different 
reaction condition with each of a group of said 
unique solid supports, said group comprising at least 
20 one of said unique solid supports, and a combination 

of identifiers wherein said combination of 
identifiers defines the choice of agent and the stage 
in said series as to each group of said unique solid 
supports, each of said identifiers being individually 
25 bound to said unique solid support directly or 

through other than a prior identifier? 
mixing said groups together and then dividing said 
plurality of unique solid supports into a plurality 
of groups for a second intermediate or final stage; 
30 and 

repeating said reacting at least once to provide a 
plurality of final products, having different 
products on the different individual unique solid 
supports • 

35 
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A method according to Claim 1, wherein at least lOO 
unique solid supports and at least 2 groups are 
employed in each said reacting. 

A method according to Claim 1, including the 
additional stages of screening said final products on 
said unique solid supports for a characteristic of 
interest; and identifying the reaction history of at 
least one final product having said characteristic of 
interest. 

A method of Claim 1 further comprising cleaving the 
final product from the solid support and screening 
said final product. 

A method of Claim 1 further comprising treating the 
identifiers so as to detach the tag components from 
the solid supports and reacting said tag components 
with a moiety capable of detection by fluorescence or 
electron capture. 

A method of Claim 5, wherein the detaching is done 
photochemically or oxidatively and the detectable 
moiety is derived from dansyl chloride or a 
polyhalobenzoylhalide. 

A method according to claim 5, wherein said tag 
components have two characteristics, a characteristic 
capable of separation and a characteristic capable of 
detection. 

A method according to Claim 7, wherein said 
characteristic capable of detection is the ability to 
be detected by electron capture. 
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9. A method according to Claim 7, wherein said 
characteristic capable of detection is the ability to 
be detected by mass spectroscopy. 

5 10- A method according to Claim 1, wherein said 
characteristic capable of detection is radioactivity. 

11. A method according to Claim 7, wherein said 
characteristic capable of detection is fluorescence. 

10 

12. A method according to Claim 7, wherein said tags may 
be separated by means of chromatography. 

13. A kit comprising a plurality of different separated 
15 organic compounds, each of the compounds 

characterized by having a distinguishable 
composition, encoding at least one bit of different 
information which can be determined by a physical 
measurement and sharing at least one common 
20 functionality- 

14. A Kit of Claim 13 comprising at least 4 different 
functional organic compounds. 

25 15. A kit according to Claim 13, wherein said functional 
organic compounds are of the formula: 

F^-F^-C-E-C' 

30 where F^-F^ is a linker which allows for attachment to 

and detachment from a solid particle? and 
C-E-C' is a tag which can be determined by a physical 
measurement. 



35 16. 



A kit according to Claim 15, wherein said functional 
organic compounds differ by the number of methylene 
groups and/or halogens, nitrogens or sulfurs present. 
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A kit according to Claim 15 wherein the C-E-C' 
portion can be removed photochemically. 

A kit according to Claim 15 wherein the C-E-c' 
portion can be removed oxidatively, hydrolytically, 
thermolytically, or reductively. 

A solid support characterized by having a ligand 
bound thereto and having a combination of identifiers 
bound to said solid support* 

A solid support according to Claim 19, wherein said 
ligand is an oligomer which is an oligopeptide, 
ol igonucleotide , ol igosaccharide , poly 1 ipid , 
polyester^ polyamide, polyurethane , polyurea, 
polyether, poly (phosphorus derivative) which is a 
phosphate , phosphonate , phosphoramide , 
phosphonamidey, phosphite, or phosphinamide , poly 
(sulfur derivative) which is a sulfone, sulfonate, 
sulfite, sulfonamide, or sulfenamide, where for the 
phosphorous and sulfur derivatives the indicated 
heteroatom for the most part will be bonded to C, H, 
N, O or S, and combinations thereof. 

A solid support according to claim 19 wherein said 
ligand is a non-oligomer which is heterocyclic, 
aromatic, alicyclic, or aliphatic, and combinations 
thereof . 

A solid support of Claim 21 wherein the non-oligomer 
is a diazabicyclic, an azatricyclic, or a branched 
amide compound. 

A solid support of Claim 19 wherein the ligand is 
linked to the support through a non-labile linkage. 
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24. A solid support of Claim 19 wherein the ligand is 
linked to the support through a cleavable linkage. 

25. A solid support according to Claim 19, wherein the 
5 identifier comprises tags, the tags being 

radioisotopes r or haloalkyl or haloarylallyl 
containing compounds. 

26. A solid support of Claim 19 which is a bead of about 
10 10-2000 Mia in diameter, and wherein the identifiers 

comprise tag components which after cleavage from the 
bead can be separated by gas chromatography and or 
liquid chromatography detected by electron capture, 
mass spectroscopy, fluorescence, or atomic emission 
15 techniques. 



27. A library comprising a plurality of solid supports 
according to claim 22. 

20 28. A library of Claim 27, wherein the final products 
have been cleaved from the solid support. 

29. A library of Claim 28, wherein the final products are 
a diazabicyclic, azatricyclic, or branched amide 

25 compounds. 

30. A process for identifying compounds having a 
characteristic of interest which comprises screening 
a library of Claim 27. 

30 

31. A process of Claim 30, wherein the compounds have 
been cleaved from the solid surface* 

32. A process of Claim 31, wherein the compound is a 
35 diazabicyclic, azatricyclic, or branched amide 

compound • 
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A method for producing a ligand involving a reaction 
series employing a method for recording the reaction 
history of a reaction series on each of a plurality 
of unique solid supports, wherein said reaction 
series involves at least two stages requiring 
differing agents and/or reaction conditions resulting 
in a different modification as to a plurality of said 
unique solid supports, resulting in a plurality of 
different final products on different unique solid 
supports, employing a combination of identifiers for 
recording said reaction history, said identifiers 
characterized by defining the choice of agent or 
reaction condition and the stage in said series and 
being capable of being analyzed as to the choice and 
stage, said method comprising: 

reacting, at a first or intermediate stage of said 
series, a different agent or employing a different 
reaction condition with each of a group of said 
unique solid supports, said group comprising at least 
one of said unicjue solid supports, and a combination 
of identifiers wherein said combination of 
identifiers defines the choice of agent and the stage 
in said series as to each group of said unique solid 
supports, each of said identifiers being individually 
bound to said unique solid support directly or 
through other than a prior identifier; 
mixing said groups together and then dividing said 
plurality of unique solid supports into a plurality 
of groups for a second intermediate or final stage; 
repeating said reacting at least once to provide a 
plurality of ligands, having different products on 
the different individual unique solid surfaces; and 
identifying said reaction history of at least one 
selected unique solid surface by means of said 
combination of identifiers* 
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A ligand according to Claim 33, wherein said 
identifying includes the stage of screening said 
ligands for a characteristic of interest. 

A method for producing a ligand involving a reaction 
series employing a method for recording the reaction 
history of a reaction series on each of a plurality 
of unique solid surfaces, wherein said reaction 
series involves at least two stages requiring 
differing agents and/or reaction conditions resulting 
in a different modification as to each of a plurality 
of said unique solid surfaces, resulting in a 
plurality of different ligands on different unique 
solid surfaces, employing combinations of identifiers 
for recording said reaction history, said combination 
of identifiers characterized by defining the choice 
of agent and/or reaction condition and the stage in 
said series and being capable of being analyzed as to 
the choice and stage, said method comprising: 
reacting, at a first or intermediate stage of said 
series, a different agent and/or employing a 
different reaction condition with each of a group of 
said unique solid surfaces, said group comprising at 
least one of said unique solid surfaces, and a 
combination of identifiers wherein said combination 
of identifiers defines the choice of agent and the 
stage in said series as to each group of said unique 
solid surfaces, each of said identifiers being 
individually bound to said unique solid surface 
through other than a prior identifier by a cleavable 
link? 

mixing said groups together and then dividing said 
plurality of unique solid surfaces into a plurality 
of groups for a second intermediate or final stages- 
repeating said reacting to provide a plurality of 
ligands having different ligands on the different 
individual unique solid surfaces; 
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10 



screening the ligands from a plurality of each of 
said unique solid surfaces for a characteristic of 
interest; and 

identifying said reaction history of at least one 
selected unique solid surface having ligand having 
said characteristic of interest by detaching the tag 
members from said unique solid surface and 
identifying said tag members by means of a differing 
characteristic . 



36, A method according to Claim 35, wherein said tags 
differ in an homologous series and are detected by 
electron capture gas chromatography or mass 
spectroscopy. 

15 

37 ♦ A compound of the Formula I: 

pUf2»C-E-C' I 
where F^-F^ is a linker which allows for attachment to 
and detachment from a support? and 
20 C-E-C' is the tag which is capable of analysis; 

E is a tag component which allows for detection, or 
allows for detection and provides for separation as 
a result of variable substitution; 

c and C' are tag components which allow for 
25 individual detection; 

is a linking component capable of being selectively 
cleaved to release the tag components; and 
F^ is a functional group which allows ready attachment 
of the compound to a synthesis support. 

30 
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38. A compound of Claim 37 having the formula: 

f1_f2_(C(E-C'),)^ 

wherein: 

is COjH, CHp:, NrV, C(0)R', oh, CHNj, SH, C(0)CHN2, 
5 S(02)C1, S(02)CHN2, Nj, NOj, NO, S{02)N3, OC(0)X, 

C(0)X, NCO, or NCS; 
f2 is ^_ 

H02 



.OB 

CHjA — 




-NC(0)0- CR=CRi-(CR'2b— , _cpU== cR^_c(r^ j^-. 



:0 R* 



-0 



_^Qj^A , —cfi' =xcfi— C(R IpA— . — [^I^ 
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— S— C(R )2 A— . — C(X)R^(r')2 a— . 



— C(OE)R^ C{r\k—. — C(0H)R— CrCHgXffi— , 

— C(0H)R^C(b')2— C(X)B— . — CCOHKCHgCHaX)— . 
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with the proviso that when is a bond, is OH or 
COOH; 

A is -0, -0C(0)0-, -0C(0)-, or -NHC(O)-; 
£ is a bond, C^-Cjq alkylene optionally substituted by 
5 1-40 

F, CI, Br, C,-C^ alkoxy, NR^R^, OR^, or NR^ or 
-[(C(R^)2)„-Y-Z-Y-(C(R^)2)„Y-Z-Y]p-; with the proviso 
that the maximum number of carbon atoms in C+C' is 
20; 

10 is H; F; CI; C^-C2o alkylene optionally substituted 

by 

1-40 F, CI, Br, C,-C^ alkoxy, NR^R^, OR*^, or NR"^, or 
-[(C{R*)2)„-Y-Z-Y-(C(R^)2)^y-Z-Y]p-; with the proviso 
that the maximum number of carbon atoms in c+c' is 
15 20; 

E is C^-C,o alkyl substituted by 1-20 F, Cl or Br; or 
Q-aryl 

wherein the aryl is substituted by 1-7 F, Cl, NOj, 
SOjR^, or substituted phenyl wherein the substituent 
20 is 1-5 F, Cl, NOj, or SOjR^; 

E-C' may be -H, -OH, or amino; 
R^ is H or C,-C^ alkyl; 

R^ is C=0, C(0)0, C(0)NR\ S, so, or SO2; 
R* is H or c,-C^ alkyl; 
25 r5 is C,-C^ alkyl; 

a is 1-5; 
b is 1-3; 

m and n is each 0-20; 
p is 1-7; 

30 Q is a bond, O, S, NR*, C«0, -C(0)NR^, -NR^C(O)-, - 

C(0)0-, 
or -0C(0)-; 

X is a leaving group such as Br, Cl, triflate, 
mesylate, 
35 tosylate, or 0C(0)0R^; 

y is a bond, O, S, or NR^; 
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Z is a bond; phenyl optionally substituted by 1-4 F, 
CI, Br, c^-c^ alkyl, <^r^6 alkoxy, C,-C^ alkyl 
substituted by 1-13 F, CI, or C^-C^ alkyloxy 
substituted by 1-13 F, CI, or Br; (C(R^) 2)^.20' °^ 
5 f ^^2) 1-20' with the proviso that when Z is a bond one 

of its adjacent Y's is also a bond and aryl is a 
mono- or bi-cyclic aromatic ring containing up to 10 
carbon atoms and up to 2 heteroatoms selected from O, 
S, and 

10 

39. A compound of Claim 38 wherein: 
is 



CO2H. OH. CHN2. C(0)CHN2. C(0)X. NCS, or CHgX; 




35 



C and C' is each independently C^-C2o alXylene 
unsubstituted or substituted by 1-40 F or CI, or [O- 
(CH2)2.3]p; 
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E is C^-C,Q alkyl substituted by 1-20 F or Cl; Q-aryl 
where aryl is a bi-cyclic aromatic ring substituted 
by 1-7 F or Cl; or Q-phenyl substituted by 1-5 F, Cl, 
NO^, or SOjR^; and 
5 Q is a bond, 0, -NR'C(O)-, or -OCtO)-. 



41. A compound of Claim 38 having the foirmula: 
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wherein Ar is pentafluoro- pentachloro-, or 
pentabromophenyl , 2,3,5, 6-tetraf luoro-4 (2,3,4,5,6- 

w pentaf luorophenyl ) phenyl , 2,4, 6-tr ichloropheny 1 , 

2,4, 5-trichlorophenyl , 2 , 6-dichloro-4-f luorophenyl , 

^ 5 or 2,3,5,6-tetrafluorophenyl, 

42. A compound of Claim 38 wherein; 
E-C' is H, OH, or NH^. 

10 43. A composition of the formula 

S-pi / -F^-C-E-C ' 

wherein: 

S is a soluble or solid support; 

C-E-C'is the tag which is capable of analysis where 
E is a tag component which (a) allows for detection, 
such as an electrophoric group wnich can be analyzed 
by gas chromatography or mass spectroscopy or (b) 
allows for detection and for separation as a result 
of variable substitution? 

C and C' are tag components which allow for 
distinguishing one tag from all other tags, usually 
allowing for separation as a result of variable 
length or substitution, for example, varying the 
chromatographic retention time or the mass 
spectroscopy ratio Z/e; 

is a linking component capable of being selectively 
cleaved to release the tag; and 

F^' is a functional group which provides for 
attachment to the support. 

44. A composition of claim 4 3 wherein: 
^ S is a capillary, hollow fiber, needle, solid fiber, 

cellulose bead, pore-glass bead, silica gel, 
polystyrene bead optionally cross-linked with 
divinylbenzene, grafted co-poly bead, poly-acrylamide 
bead, latex bead, dimethylacrylamide bead optionally 
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cross-linked with N,N'-bis-acryloyl ethylene diamine, 
glass particles coated with a hydrophobic polymer, or 
low molecular weight non-cross-l inked polystyrene; 
and 

F^'-F^-c-E-C' is the residue of Formula I attached to 
S. 

45. The method of Claim 1, wherein the combination of 
identifiers defines a binary coding scheme, 

46. The method of Claim 1, wherein the identifiers are of 
Claim 37. 

47. The method of Claim 1, wherein the identifiers are of 
Claim 38. 

48. The method of Claim 1, wherein the identifiers are of 
Claim 39. 

49. The method of Claim 1, wherein the identifiers are of 
Claim 42. 

50. The method of Claim 1 further comprising detaching 
the tag members from said unique solid surfaces. 

51. The method of Claim 50 wherein the tag members are 
detached photochemical ly, oxidatively, 
hydrolytically, theirmolytically, or reductively. 

52. The method of Claim 1 further comprising detaching 
non-oligomer ligands from said unique solid surfaces 
photochemical ly . 
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53. A compound of the formula 



CI 




0 R 



wherein: 

P is a polystyrene resin; 

IX^.^ is a plurality of residues of the formula 



CI 




0 CI 



wherein: 
n is 1 - 6; 

R is CH3, CH(CH3)2, CHjCO^H, (CH2}^li}i2, CHj-C^H^-OH, or 

is H, CH3, C2H5, CH^CH^CHj, or CH^C^U^. 

54. A method of synthesizing a chemical compound so that 
the structure of the compound is readily 
determinable, which comprises synthesizing the 
compound on the surface of a solid support under 
conditions such that the solid support at the 
completion of the synthesis of the compound has bound 
to it a plurality of identifiers which encode the 
reaction stages associated with the synthesis of the 
compound . 



I 
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55. A method of synthesizing a library of chem:^cal 
compounds so that the structure of each compound in 
the library is readily determinable which comprises 
synthesizing each compound on the surface of a unique 
solid support under conditions such that each such 
unique support at the completion of the synthesis of 
the library of compounds has bound to it a plurality 
of identifiers which encode the reaction stages 
associated with the synthesis of the compound 
synthesized on such solid support. 

56 • A method of determining the structure of a chemical 
compound which comprises synthesizing the compound by 
the method of claim 54 or 55, isolating the solid 
support upon which the compound was synthesized, 
treating the solid support so isolated so as to cause 
the tag components of each of the identifiers bound 
to the solid support to be released, determining the 
identity or quantity or both of each tag component so 
released, and deriving the structure of the compound 
from the identities or quantities or both of all such 
tag components. 

57. A method of identifying a compound having a desired 
characteristic which comprises synthesizing a library 
of chemical compounds by the method of claim 55, 
separately testing each of the compounds in the 
resulting library in an assay which identifies 
compounds having the desired characteristic so as to 
identify any compounds present in the library which 
has the desired characteristic. 

58. A method of claim 57, further comprising determining 
the structure of the compound so identified. 

59. A library of chemical compounds, each compound in the 
library being bound to a unique solid support and 
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each such solid support having bound to it a 
plurality of identifiers which encode the reaction 
stages associated with the synthesis of the compound 
bound to such solid support. 

> 

60. A library of claim 59, wherein compounds in the 
library are diazabicyclic compounds. 

61. A library of claim 59, wherein compounds in the 
library are azatricyclic compounds. 

62. A library of claim 59, wherein compounds in the 
library are branched amide compounds. 

63. A library of claim 59, wherein compound in the 
library are peptides. 

64. A method of identifying a compound having a desired 
characteristic which comprises testing a library of 
chemical compounds according to claim 58 in an assay 
which identifies compounds having the desired 
characteristic so as to identify any compound present 
in the library which have the desired characteristic. 

65. A method of claim 64, further comprising determining 
the structure of the compound so identified. 

66. A compound identified by the method of claim 63. 

67. A method of claim 64, wherein the desired 
i characteristic is antagonism for the human neurokinin 

1/brandykin receptor and the library of chemical 
\ compounds comprises azatricyclic compounds. 

68. A method of claim 64, wherein the desired 
characteristic in usefulness as a muscle relaxant, a 
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tranquilizer or a sedative and the library of 
chemical compounds comprising bezodiazopines. 

69. A method of claim 64, wherein the desired 
characteristic is useful in the treatment of 
hypertension or Raynaud's syndrome and the library of 
chemical compounds comprises branched amides. 
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